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America’s choice 


The values of scientific enquiry, rather than any particular policy positions on science, suggest a 
preference for one US presidential candidate over the other. 


crossroads, but the choice to be made on 4 November feels 
unusual, and daunting, in its national and global significance. 

Science and the research enterprise offer powerful tools for address- 
ing key challenges that face America and the world, and it is hearten- 
ing that both John McCain and Barack Obama have had thoughtful 
things to say about them. Obama has been more forthcoming in his 
discussion of research goals (see Nature 455, 446-449; 2008), but 
both have engaged with the issues. McCain deserves particular credit 
for taking a stance on carbon emissions that is at odds with that of a 
significant proportion of his party. 

There is no qpen-and-shut case for preferring one man or the other 
on the basis of their views on these matters. This is as t should be: for 
science to be a narrow sectional interest bundled up in a sirgle party 
would be a terrible thing. Both sides recognize science’s inspirational 
value and ability to help achieve national and global goals. That is 
common ground to be prized, and a scientific journal's discussion of 
these matters might be expected to stop right there. 

But science is bound by, and committed to, a set of normative 
values — values that have application to political questions. Placing 
a disinterested view of the world as it is ahead of our views of how it 
should be; recognizing that ideas should be tested in as systematic a 
way as possible; appreciating that there are experts whose views and 
criticisms need to be taken seriously: these are all attributes of good 
science that can be usefully applied when making decisions about the 
world of which science is but a part. Writ larger, the core values of 
science are those of open debate within a free society that have come 
down to us from the Enlightenment in many forms, not the least of 
which is the constitution of the United States. 

On a range of topics, science included, Obama has surrounded 
himself with a wider and more able cadre of advisers than McCain. 
This is not a panacea. Some of the policies Obama supports — 
continued subsidies for corn ethanol, for example — seem misguided. 


Te election of a US president almost always seems like a 


The advice of experts is all the more valuable when it is diverse: 
‘groupthink is a problem in any job. Obama seems to understands 
this. He tends to seek a range of opinions and analyses to ensure 
that his own opinion, when reached, has been well considered and 
exposed to alternatives. He also exhibits pragmatism — for example 
in his proposals for health-care reform — that suggests a keen sense 
for the tests reality can bring to bear “The core values of 


on policy. : h f 
Some will find strengths in McCain science are t —? 
open debate within a 


that they value more highly than the 
commitment to reasoned assessment free society that have 
come down to us from 


that appeals in Obama. But all the signs 

are pee former seeks a narrower 4h, Enlightenment.” 
range of advice. Equally worrying is 
that he fails to educate himself on crucial matters; the attitude he 
has taken to economic policy over many years is at issue here. Either 
as a result of poor advice, or of advice inadequately considered, 
he frequently makes decisions that seem capricious or erratic. The 
most notable of these is his ill-considered choice of Sarah Palin, the 
Republican governor of Alaska, as running mate. Palin lacks the 
experience, and any outward sign of the capacity, to face the rigours 
of the presidency. 

The Oval Office is not a debating chamber, nor is it a faculty club. 
As anyone in academia will know, a thoughtful and professorial air 
is not in itself a recommendation for executive power. But a commit- 
ment to seeking good advice and taking seriously the findings of 
disinterested enquiry seems an attractive attribute for a chief execu- 
tive. It certainly matters more than any specific pledge to fund some 
particular agency or initiative at a certain level — pledges of a sort now 
largely rendered moot by the unpredictable flux of the economy. 

This journal does not have a vote, and does not claim any particular 
standing from which to instruct those who do. But if it did, it would 
cast its vote for Barack Obama. 2 


Growing stronger 


Science in developing countries can withstand 
the current economic climate. 


ith the news dominated by failing banks and falling stock 
VV markets, it is easy to forget that some indices are moving 

upwards — such as those that record the strength of sci- 
ence, technology and innovation in developing nations. Despite the 
turmoil, this is likely to continue. 


Snapshots of this trend are captured this week in. A World of Science 
in the Developing World, a publication by TWAS, the academy of 


sciences for the developing world, which celebrates its 25th anniver- 
sary next month (copies are distributed with this issue to subscrib- 
ers, and are freely available at www.nature.com/twas). In the past 
25 years, it is not just the larger countries such as Brazil, China, India 
and Mexico where the volume and the quality of scientific research 
have been transformed. Smaller countries such as Chile, Malaysia, 
Rwanda and Vietnam all regard investing in new knowledge, technol- 
ogy and higher education as national priorities. 

The reasons are not hard to spot: most countries of the devel- 
oped world invest 2% or more of their national incomes on research 
and development (R&D). At the same time, they enjoy some of the 
highest standards of living in the world. The link between R&D 
spending and national wealth is subject to much debate among 
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social scientists (see, for example, D. Edgerton Nature 455, 1030- 
1031; 2008). But many funders and science policy-makers see the 
relationship in more straightforward terms: more science eventually 
provides valuable social returns, including greater wealth, education 
and a skilled workforce. 

Even the oil-producing states of the Middle East — which have 
enjoyed good living standards without spending on science and tech- 
nology — are coming round to this view. The largest-ever university- 
building programme in the region’s history is now under way, financed 
for the most part by the extra income from high oil prices. 

As the price of oil (and other commodities) drops along with con- 
fidence in the markets, the strategy of investing more in science is 
going to be tested to the hilt. The larger developing countries should 
be able to weather the storm — in part because the state is a selatively 
larger player in research, compared with more developed countries. If 
private-sector R&D begins to dedine, it should still be possible for the 
state to maintain investment at — or near — aurrent levels. 

Smaller (and poorer) countries will have a harder time. Many are 


already borrowing from institutions such as the International Monetary 
Fund (IMF), and their priority will be to increase spending on welfare 
programmes. But those that want to stay on the present path have more 
grounds for hope than was the case in past economic crises. 

In the 1980s, for example, countries that borrowed from the IMF 
and from the World Bank were told to cut back on government 
spending. Science and higher education were two of the biggest casu- 
alties, and in a £w countries the effects can still be seen. In 2008, the 
mood couldn't be more different. In their response to the financial 
crisis, many developed world governments are talking not of cut- 
ting back, but of rebuilding and strengthening public institutions. In 
addition, there is a consensus that the world’s financial architecture 
needs reforming too. Given such an environment, it will be difficult 
to justify asking poorer countries to cut back on their knowledge 
infrastructure when richer countries will be doing the opposite. 

There is no doubt that the next few years will be difficult, but for 
the countries of the developing world, the answer to their present 
problems is not to undo the gains of the past quarter-century. 7 


A risk worth taking 


An experiment by the Gates Foundation is food for 
thought for other research agencies. 


bacterium Helicobacter pylori was involved in gastritis and pep- 

tic ulcers met with ingrained resistance from the guardians of 
stomach-acid wisdom in the 1980s. Against the odds, the two pre- 
vailed, revolutionizing care of the disease and receiving a Nobel prize 
in 2005. But how many other potentially groundbreaking ideas are 
dragged down a dark alley and quietly strangled by overly conserva- 
tive peer review of grant proposals? 

Research funding should strive for a balanced portfolio that indudes 
both safe investments and higher-risk work. While the world’s financial 
system has been inflated with wildly excessive risk, research funding 
has had the opposite problem — exacerbated by ever greater com- 
petition for limited funds, it is overly wedded to safe, unadventurous 
research. This, in effect, ostracizes off-the-wall ideas, which often cross 
disciplinary boundaries and would have potentially big payoffs should 
they work. Researchers long ago learned that the last people they should 
tell about their big ideas are their sources of financial support. 

To be fair, there are exceptions to such conservatism. The US 
National Institutes of Health, for example, has systematically promoted 
risky research through several initiatives (see http://nihroadmap. 
nih.gov/highrisk), although the jury is out on its scientific impact. 
Military imperatives for innovation led the US Defense Advanced 
Research Projects Agency to transgress the conventions of academic 
business-as-usual, with undisputed success. 

In a similar vein, frustrated with repeatedly encountering the same 
faces from the same countries, and a narrow range of ideas, the Bill & 
Melinda Gates Foundation decided to cast the net wider in he search 
for new people and ideas. Last week, it announced the 104 winners 
of the first round of its five-year, US$100-million ‘Grand Challenges 


B arry Marshall and Robin Warren's unorthodox idea that the 
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Explorations’ programme. This solicited unconventional ideas for 
protecting against infectious diseases, limiting drug resistance and 
exploring latent tuberculosis. Future rounds will include ideas for 
vaccines for killer diseases and tools to help eradicate malaria. 

It is the nature of the call for proposals, and their peer review, that is 
intriguing. The grant proposal is one that many researchers can only 
dream of — a two-page explanation of the idea, with no supporting 
data required. ‘To emphasize that it’s the idea that matters, reviewers 
were blinded to the name, profession and nationality of the applicant. 

The reviewers themselves were atypical. Instead of consensus 
review by experts in the field — as is he norm — the 4,000-odd pro- 
posals received were sent to individuals, not just in science but also in 
engineering, business and beyond — people the foundation considers 
to be champions’ with strong track records in high-risk research. 

It is too soon to assess the projects’ chances of success, but what 
is clear is that many of the projects and their proponents are very 
different from those in conventional global-health research. They 
include work to explore links between natural HIV-resistance and 
genetic markers for type 2 diabetes, a proposal from an astrophysicist 
to reduce transmission of malaria with a flashlight that would diso- 
rientate mosquitoes, and testing the bacterium Bdellovibrio, which 
predates on many Gram-negative pathogens, as a living antibiotic. 

The first-round grants, at $100,000, are small. Dishing out large 
sums of money on far-fetched ideas would be foolhardy, given that 
as many as nine out of ten of these projects are expected to fail. But 
those that show signs of success will be eligible for further funds of 
$1 million, or much more. All told, however, the entire programme 
amounts to just 10% of the Gates Foundation’s new Global Health 
Discovery wing, which itself accounts for just 20% of the organiza- 
tion’s overall global-health research budget. 

Such levels of investment seem sensibly aligned with levels of risk. 
Risk-taking brings new faces and ideas to the table at reasonable cost 
and spurs creativity. Research agencies everywhere need to take a 
hard look at their funding portfolios to ensure that they are investing 
sufficiently in high-risk and potentially transformative research. m 
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Turn of the screw 


Phys. Rev. Lett. 101, 178102 (2008) 

The tubular cells of the aquatic plant Chara corallina are 
several centimetres long and about 1 millimetre wide — too 
big for diffusion to effectively disperse nutrients within 
them, but too small for the pumping circulatory systems of 
large organisms. So the cytoplasm of such a cell circulates 
by advection. This is driven by the movement of myosin 
motor proteins along ‘tracks’ of the filamentous protein 
actin on the cell's inner walls. But the details of how this 
works have been unclear. 

Jan-Willem van de Meent and his colleagues at the 
University of Cambridge, UK, now present a model in 
which advection by the myosin ‘conveyor belts’ generates 
two counterflowing, helically entwined vortices. These mix 


and homogenize solutes much faster than would diffusion. 
The idea could be copied in microfluidic systems, in which | 


generating rapid mixing is difficult. 


Hot and bothered 


Proc. Natl Acad. Sci. USA doi:10.1073/ 
pnas.0806830105 (2008) 

Like many spiders, Erigone atra, which tends 
to live among crops, has a choice to make 
when it decides to move. It can either spin a 
silk sail to ‘balloon’ a relatively long way, or 
spin a rope and rappel a shorter distance. 

A team led by Dries Bonte of Ghent 
University in Belgium has discovered that 
the temperature that E. atra experiences 
during development influences the means of 
transport it chooses. The researchers raised 
520 spiders in Petri dishes and then placed 
them on platforms in a wind tunnel. Spiders 
raised under cool, spring-like conditions 
tended to balloon, whereas those raised in 
summer-like conditions preferred to rappel. 


Stemming senescence 


Cell 135, 227-239 (2008) 
Researchers have uncovered a biochemical 
pathway involving tumour suppressor 
proteins that changes the expression of genes 
in neural stem cells as the cells age. This casts 
the ebbing regenerative capacity of ageing 
tissues as a trade-off that keeps cancer at bay. 
Sean Morrison at the University of 
Michigan in Ann Arbor and his colleagues 
worked out that the decrease in Hmga2, a 
regulator of gene expression, that occurs 
as neural stem cells age allows levels of the 
tumour suppressors p16" and p19*" to 
increase. This leads to lower self-renewal rates. 
The researchers counted the number of neural 
cells per section of intestine (pictured right) 
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as one measure of self-renewal. 

Hmga2 promotes cell renewal in young 
mice by maintaining low tumour suppressor 
levels. In old mice, however, avoiding cancer 
takes precedence over keeping neural matter 
youthful. 


NO problem 


Nature Chem. Biol. doi:10.1038/nchembio.115 (2008) 
Nitric oxide (NO) is involved in a host 
mammalian signalling pathways, and several 
enzymes are charged with its synthesis. The 
overactivity of iNOS, one of these enzymes, 
has been linked to multiple sclerosis and 
Alzheimer’s disease. Now researchers have 
found a way to inhibit it. 

Elsa Garcin and Elizabeth Getzoff at 
the Scripps Research Institue in La Jolla, 
California, and their colleagues reasoned 
that the best approach to selectively block the 
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action of this enzyme would be to study the 
structures of two drugs that preferentially 
hinder iNOS over other NO-producing 
enzymes. They then designed a new iNOS 
inhibitor that anchors itself to a binding 
pocket on the enzyme before inducing a 
cascade of conformational changes that make 
the inhibitor’s grip more secure. 


Sound trap 


Nature Phys. doi:10.1038/nphys1101 (2008) 

Since the 1950s, physicists have known 

that a very disordered material can trap 
electrons. This is because of the electron’s 
wave behaviour. When the degree of disorder 
is sufficient, the electron waves can become 
‘localized’ in a single spot. 

This trapping behaviour is complex, but 
John Page at the University of Manitoba in 
Winnipeg, Canada, and his colleagues have 
created an analogous but simpler system for 
sound waves. They fused small aluminium 
ball-bearings together and, using ultrasound, 
were able to observe sound waves become 
trapped in the disordered aluminium 
structures. They believe that their work may 
improve the understanding of localization, 
which could have applications in electronics. 


Born to reign 


Science 322, 552 (2008) 

Harvester ants develop into queens or sterile 

workers according to their genes as well as 

their environment, which had been thought 

to be the lone determinant of female caste. 
Tanja Schwander, currently at Simon 


J.-W. VAN DE MEENT, M. POLIN, R. E. GOLDSTEIN/UNIV. CAMBRIDGE 


Fraser University in Burnaby, Canada, and 
Laurent Keller of the University of Lausanne in 
Switzerland noticed that some combinations 
of genes are more common in queen harvester 
ants (Pogonomyrmex rugosus) than in workers, 
and vice versa. So they crossed queens from six 
colonies with males of the same and different 
colonies to see what effect this might have. 

The duo found that certain cross-colony 
pairings — and therefore particular gene 
combinations — reliably skewed the 
proportion of queens to female workers 
compared with the average. 


NEUROSCIENCE 


Twitchy details 


Cell 135, 334-342 (2008); Neuron 60, 285-297 (2008) 
The connections between nerves and the 
muscles that enable animals to move and 
breathe cannot form without a protein called 
Agrin that activates an enzyme called MuSK. 
Two research groups have discovered that 
Agrin binds to Lrp4, a 
receptor, and together 
they switch on MuSK. 

Steven Burden and 
a team from New York 
University Medical 
School stimulated 
normal and Lrp4- 
deficient developing 
muscle fibres with 
Agrin. MuSK was only 
activated in cells with 
Lrp4. The team then infected the mutant cells 
with a virus carrying the Lrp4 gene, which 
caused these cells to start producing the 
receptor. Agrin could then turn on MuSK. 

Lin Mei, of the Johns Hopkins University 
School of Medicine in Baltimore, Maryland, 
and his co-workers linked Lrp4 expression to 
the clustering of neurotransmitter receptors at 
the junction of nerve and muscle cells. 


MOLECULAR BIOLOGY 


On the mend 


Science 322, 597-602 (2008) 
Some sorts of DNA damage tend to get 
repaired right at the edge of a cell's nucleus, 
Susan Gasser of the Friedrich Miescher 
Institute for Biomedical Research in Basel, 
Switzerland, and her colleagues have shown. 
They followed the process of DNA repair 
by inducing a DNA double-strand break and 
tracking fluorescent proteins tagged near the 
site of the break. Damaged DNA was shunted 
to the edge of the nucleus — to a subcomplex 
of the nuclear pore that facilitates the 
exchange of RNA and proteins between 
the cytoplasm and the cell nucleus. The 


transport involved repair proteins that are 
bound to a protein called SUMO, and are also 
recognized by a pore-associated ubiquitin 
ligase, a type of enzyme. 


ECOLOGY 
Cone heads 


Proc. R. Soc. B doi:10.1098/rspb.208.0742 (2008) 
The affinity of insects for bright lights is well 
known, but at least one species seems to ‘see 
the heat of seed cones (pictured below, right 
image taken with infra-red camera) from afar. 

Gerhard Gries of Simon Fraser University 
in Burnaby, Canada, and his colleagues have 
identified infra-red (IR) receptors on conifer 
seed bugs (Leptoglossus occidentalis), and 
shown with the aid of IR sources in the lab 
and the field that these creatures prefer hot 
things. The insects displayed a preference for 
traps that emitted strong rather than weak 
IR — unless their IR receptors were covered 
with silica paint. 


ASTRONOMY 


Planet probing 


Astrophys. J. 686, 1341-1348 (2008) 

A tiny bit of light from an extrasolar ‘hot 
Jupiter’ has allowed astronomers to speculate 
about the diversity of the atmospheres of 
similar planets. 

David Charbonneau of the Harvard- 
Smithsonian Center for Astrophysics in 
Cambridge, Massachusetts, and his colleagues 
used the Spitzer Space Telescope to observe 
a relatively nearby hot Jupiter before and 
after a Secondary transit: These occur as a 
planet goes behind a star. The number of 
photons detected before and after the eclipse 
is compared with the number detected during 
the eclipse, and the difference is the number 
that can be attributed to the planet itself. 

Charbonneau’s team discerned that 
the hot Jupiter emitted less light in three 
spectral bands associated with water than 
in surrounding bands, contrary to findings 
from another hot Jupiter. This suggests that 
there are at least two types of hot-Jupiter 
atmosphere: one that gets cooler with altitude, 
and another that has a hat layer high up. 
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RESEARCH HIGHLIGHTS 


JOURNAL CLUB 


Bartosz Grzybowski 
Northwestern University, 
Evanston, Illinois 


A physical chemist is pleased to 
learn that ‘microscale’ swimming 
isn't that hard after all. 


Even if small organisms perfectly 
mimicked gold medallist Michael 
Phelps's technique, they wouldn't 
win a microswimming Olympics. 
The viscosity of water is so high 
that these little fellows have had to 
develop some unusual swimming 
styles. In 1977, E. M. Purcell formally 
expressed this idea with his famous 
‘scallop theorem’. He showed that 
swimming forwards cannot be 
achieved at the micrometre-scale 
with ‘time-reversible’ motions 
such as the back-and-forth 
wiggling of a rigid tail. Instead, 
tiny organisms must use complex, 
asymmetrical strokes. 

But this is not always the 
case, according to engineers at 
the Massachusetts Institute of 
Technology in Cambridge and the 
University of California, San Diego. 
In July, they proved that time- 
reversible tail-wiggling or wing- 
flapping can be a viable mode of 
propulsion through a fluid, provided 
itis done next to a deformable 
interface such as a soft membrane 
(R. Trouilloud et al. Phys. Rev. Lett. 
101, 048102; 2008). The reversible 
motions of the swimmer coupleina 
nonlinear way to the deformations 
of the interface, producing 
additional flows and forces that are 
sufficient for locomotion. 

One of the most exciting 
extensions of this result might 
be in creating ‘nanosubmarines’ 
—amuch-criticized dream of 
nanotechnologists to have devices 
navigate blood vessels, finding 
and fixing damaged organs 
as they go. The idea has so far 
seemed implausible because such 
machines would need elaborate 
nanopropellers — which are 
prohibitively difficult to build — to 
sculpt asymmetrical swimming 
motions. But what about using a 
simpler propulsion mechanism 
and relying on the deformations 
of blood-vessel walls to move 
nanosubmarines along? Is there a 
nanoshipyard out there somewhere 
to put this idea to the test? 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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NEWS 


THE ELECTION LANDSCAPE 


Along with the US presidency, also up for grabs at the 4 November US elections 
are 470 congressional seats, 11 governor seats and a series of ballot initiatives. 
Here's a look at some of the races important to science. 


CALIFORNIA. 
Proposition 7 would require the state to 
produce 40% of its electricity from renewable 
sources — up from the current mandate of 
20% — by 2020. Environmental groups, 
however, say that the proposal allows too 
many loopholes for industry. 


COLORADO. 


Amendment 48 would change the definition of 


‘person’ to include human fertilized eggs, 


embryos and fetuses — potentially outlawing 
birth-control methods that interfere with the 


implantation of a fertilized egg. 


NEBRASKA 
Republican Mike Johanns, the former 
secretary of agriculture, is favoured to win 
his Senate race against Democrat rancher 
Scott Kleeb. 


- 


The retirement of long-time Republican senator 
Pete Domenici opens up a spot representing some 
of the country’s most powerful nuclear-weapons 
centres — Los Alamos National Laboratory and 
Sandia National Laboratories. 


MICHIGAN 
Proposal 2 would allow state researchers to 
derive new human embryonic stem-cell lines 
(see ‘Stem-cell law goes to the polls’, below). 


ILLINOIS. i> 


Bill Foster (Democrat), one of three 
physicists in the House of Representatives, 
is favoured to win over Republican 

businessman Jim Oberweis. 


MISSOURI 

Proposition C would require investor- 
owned utilities to generate 15% of their 
power from renewable sources by 2021. 


Stem-cell law goes to the polls 


to say no. According to Michigan law, 
the woman could donate the embryos 
to another state or throw them away — 
but not give them to a local researcher to 
derive new stem-cell lines. 

Michigan’s scientific reputation could 
change on 4 November. Voters will not 
only choose the next US president (see 
Nature 455, 442-453; 2008) but also say 
yes or no to a state ballot measure on 
stem-cell research, known as Proposal 2 
— one of several initiatives across the 
country (see map). If passed, the proposal 
would amend a 1978 state law banning 
research on live human embryos, which 
currently prevents Michigan researchers 
from deriving new human embryonic 
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Last year Sean Morrison, a 
stem-cell scientist at the Uni- 
versity of Michigan in Ann 
Arbor, received an e-mail 
from a woman wanting to 
donate extra em bryos 

from her in vitro fer- 

tilization procedure 
towards his research into 
Parkinson’s disease. He had 


stem-cell lines, and which they see as limiting 
their research. 

The initiative would also mean that, instead 
of having to discard embryos left over from 
fertility treatment, women could legally donate 
them directly to state stem-cell research cen- 
tres. The state’s ban on using somatic-cell 
nuclear transfer to produce a human embryo 
would still hold, and buying and selling human 
embryos would also become illegal. 


Bill Clinton (left) and Al Taubman back Michigan's Proposal 2. 
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Supporters of the initiative had first tried to 
work within the state legislature to change what 
they see as overly restrictive laws. But the bills 
never got out of committee, says Chris De Witt, 
spokesman for CureMichigan, the group that 
sponsors Proposal 2. Proponents tried to get a 
similar initiative on a ballot in 2006, but failed 
to collect the necessary number of signatures. 
It’s not clear whether the political tides have 
turned in the initiative’s favour this year; a 
recent poll by a local newspaper and tele- 
vision stations showed a roughly equal 
split between those who would vote for 
the measure and those who would not. 
It has, however, attracted high-profile 
supporters such as Michigan governor 
Jennifer Granholm, billionaire Al Taub- 
man and former president Bill Clinton. 

Opponents cite the destruction of 
human embryos as a reason to vote against 
the measure. Some groups are specifically 
worried about part D of the state ballot 
proposal, which dedares it would “prohibit 
state and local laws that prevent, restrict 
or discourage stem-cell research, future 
therapies and cures”. That, in essence, 
would allow future stem-cell research in 
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doomed Hitchcock's 
Sy kamikaze gulls 
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the state to go unregulated, argues state senator 
Tom George (Republican), who co-chairs the 
opposition group, Michigan Citizens Against 
Unrestricted Science and Experimentation. 

That’s simply not true, counters De Witt. 
He notes that state institutions that conduct 
human embryonic stem-cell research have 
internal ethics review boards to monitor the 
studies, and that researchers must also follow 
federal regulations regarding investigations 
into humans and human tissue. The same 
constraints would hold, he says, if researchers 
were allowed to derive their own lines. 

The initiative has “major significance” 
because presidential candidates John McCain 
and Barack Obama have both hinted that, if 
elected, they would loosen federal restrictions 
on embryonic stem-cell research, says Stephen 
Rapundalo, executive director of MichBio in 
Ann Arbor, a non-profit organization trying to 
drive growth of the life-sciences industry in the 
state. Currently, federal research money can be 
used for work only on cell lines detived before 
9 August 2001, 
the date the fed- 
eral restrictions 
came into effect. 
The University 


“We are already at 
a disadvantage in 
recruiting faculty 


who specialize in of Michigan uses 
h b ; 11 of these fed- 
uman embryonic Lailappioged 


stem-cell research.” _ lines, which were 
grown usin g 
mouse cells and which, they say, are less than 
ideal for human clinical research. 

If federal law changes after the election but 
state law does not, Rapundalo says, stem- 
cell scientists in the state will be at a distinct 
disadvantage. Many fear that leading research- 
ers will leave Michigan for other states that 
support the work, such as California and 
New York. Sue O'Shea, director of the Michi- 
gan Center for Human Embryonic Stem Cell 
Research in Ann Arbor, says she has seen 
many of her best students leave — especially 
those who want to work on embryonic rather 
than adult stem cells. 

Morrison agrees. “We have already been at 
a disadvantage in recruiting faculty members 
who specialize in the area of human embryonic 
stem-cell research,” he says. “If the proposal 
does not pass, this will not improve.” 

Nevertheless, unlike California’s $3-billion 
stem-cell agency, which was created through a 
ballot initiative, the Michigan initiative has no 
money attached to it. “So even if it does pass, it 
wont necessarily allow us to develop any new 
embryonic stem-cell lines; O'Shea says. “It will 
just make life easier knowing we can do it” m 
Ashley Yeager 
See Editorial, page 1149. 


Alzheimer's tests under fire 


Genetic testing for Alzheimer’s disease tells 


a cautionary tale about the legal, medical 
and ethical complications of personal 
genomics, as the story of a Pennsylvania 
company shows. 

Smart Genetics, based in Philadelphia, 
has stopped offering its controversial 
‘Alzheimer’s Mirror’ genetic test just 
eight months after introducing it. The 
test checked for variants in a gene, called 
APOE, that bestow as much as a 15-fold 
increased risk of developing Alzheimer’s. 
Soon after launching the test, though, 
Smart Genetics chief executive Julian 
Awad found himself in a controversy over 
whether it violated intellectual-property 
agreements covering APOE testing. 


Smart Genetics’ tests were performed by 


Athena Diagnostics, based in Worcester, 
Massachusetts. Athena 


Emory University School of Medicine in 
Atlanta, Georgia, notes that the firm faced 
another roadblock: Smart Genetics was 
charging hundreds of dollars for one test, 
whereas other gene-scanning firms offer 
Alzheimer’s risk assessments along with 
other tests. “The financial model was hard 
to support,’ he says. 

The tests offered by other firms bring 
issues of their own. For instance, Navigenics 
of Redwood Shores, California, provides 
Alzheimer’s risk assessments by testing 
variants of a gene called APOC1, which 
sits next to APOE on chromosome 19. 
Navigenics uses APOC] variants to predict 
APOE status on the basis of published 
reports that certain variants of the two 
genes are often inherited together. But 
APOC1 is not a perfect proxy for APOE 

variants. Navigenics chief 


had, in turn, licensed operating officer Sean 
the patents from Duke George says the company 
University in Durham, is switching to testing 
North Carolina, where APOE variants directly. 
researcher Allen Roses Roses thinks genetic 
discovered the APOE testing for the risk of 
link to Alzheimer’s in Alzheimer’s will only get 
the early 1990s. Roses more complex. He claims 
and Duke argue that to have unpublished data 
Athena's licence covers suggesting that variants 
APOE testing only in Can genes reliably predict riskforan in another gene canbe 
people who alreadyhave —_ Alzheimer's brain (left)? used with APOE variants 
symptoms of dementia. to predict, to within 


“The test was never intended to be used 
for wholesale screening of non-cognitively 


impaired individuals,” adds Alan Herosian, 


director of corporate alliances for Duke 
University. He says he has contacted 
Athena many times in recent months to 
press this point. 

Michael Henry, Athena’s vice-president 
of business development, wouldn't 
comment on whether the company agreed 
with this interpretation of its licence. But 
Smart Genetics is no longer taking new 
orders for Alzheimer’s Mirror. Its website 
says the test is currently unavailable 
because of “high demand”. The company’s 
phone lines have been disconnected, and a 
Philadelphia newspaper, the Philadelphia 
Business Journal, reported earlier this 
month that the company has closed. 

Smart Genetics co-founder Richard 
Watson would not comment on the 
newspaper article, and Awad did not 
respond to e-mails or phone messages. But 
one member of the company’s scientific 
advisory board, Andrew Faucett of 
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5-10 years, the age at which a person will 
develop Alzheimer’s disease. 

But he also hopes that ethical aspects of 
risk assessments will change if clinical 
trials identify drugs to treat the disease. 
Currently, knowing one’s risk of developing 
the disease may simply cause needless 
worty, as there is no prevention or treatment. 
But Roses notes that the firm for which he 
worked previously, GlaxoSmithKline based 
near London, has reported preliminary 
data that its drug rosiglitazone benefits 
patients with Alzheimer’s if they don’t carry 
ahigh-risk APOE variant (A. D. Roses 
Alzheimer’s Dementia 4, 164-166; 2008). 
The company is scheduled to finish clinical 
trials next year to test whether the findings 
hold up in larger numbers of patients with 
Alzheimer’s. 

If they do, Roses notes, it will mean APOE 
tests might be more useful, as they could 
help identify patients who will benefit 
most from treatment: “That’s where we are 
approaching very, very rapidly,’ he says. 
Erika Check Hayden 
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No burial for 10,000-year-old bones 


SAN DIEGO 

In the latest twist in the tug-of-war between 
Native Americans and anthropologists, officials 
at the University of California have decided 
not to repatriate a pair of well-preserved 
skeletons that are nearly 10,000 years old. 

Archaeology students unearthed the bones 
in 1976 near the clifftop home of the chancel- 
lor of the University of California, San Diego 
(UCSD). It may be possible to extract some of 
the oldest human DNA in North America from 
the exquisitely preserved remains, say research- 
ers. But in the past two years the bones have 
become a political football over US$7-million 
plans to demolish and rebuild the house. 

A group of 13 local bands, known as the 
Kumeyaay tribes, argued that the site was a 
sacred burial site, and that the bones found 
there should be repatriated to them. In March 
this year, UCSD dropped plans to knock down 
the house, opting instead for a renovation. But 
last week, University of California officials 
notified federal authorities that the bones could 
not be proved to be culturally affiliated with the 
Kumeyaay and thus would not be returned. 

Steve Banegas, a tribal spokesman for the 
Kumeyaay, says they hadn't been notified of 
the decision. “They are our relatives,” he says. 


“We want them reburied. They should stop 
playing politics with the remains.’ 

The dispute reflects the increasingly acrimo- 
nious debate over decisions involving ancient 
skeletons. In 2004, a £deral court ruled that the 
roughly 9,300-year-old Kennewick Man skel- 
eton, found in a tiverbank in Washington state, 
should not be returned to local tribes that could 
not prove cultural affiliation. In other cases, 
usually involving younger bones, museums 
have returned specimens when 
they were shown to be culturally 
affiliated to local tribes. 

In San Diego, tribes newly 
enriched by casino earnings 
have enlisted powerful state 
legislators to their cause. Facing 
such pressure, University of California officials 
are reviewing the 10-campus university’s policy 
on how cultural affiliation is determined. 

Currently, decisions about cultural affiliation 
are made by a panel of scientists — typically 
including a Native American — at each cam- 
pus. Campus actions are then reviewed by a 
nine-person University of California panel, 
which includes two Native Americans, before 
a final decision is reached. But in September, 
the office of Mark Yudof, the president of the 


“There are many 
points of view onthe 
cultural affiliation of 
the remains." 


University of California, initiated discussions 
about possibly eliminating the system-wide 
committee. 

Four prominent University of California 
anthropologists wrote a letter to Yudof on 
30 September, strenuously objecting to the 
proposed change. They indude Phillip Walker 
and Michael Glassow of the University of Cali- 
fornia, Santa Barbara; Robert Bettinger of the 
University of Californa, Davis; and Philip 
Wilke of the University of Cali- 
fornia, Riverside. “It is coun- 
terproductive to devolve final 
decision-making authority to 
the often inexperienced and 
legally ill-informed level of the 
local campus,’ says the letter. 

In an interview, Bettinger said that the 
system-wide panel serves as a vial form of peer 
review. “If the analysis is not rigorous, some- 
thing is missed or a campus drops the ball, the 
University of California system-wide panel 
can correct that,” he says. “This has happened 
a bunch of times.” For instance, in 2001 the 
system-wide committee overruled a decision 
by the University of California, Los Angeles, in 
which skeletons and funerary objects were rec- 
ommended for repatriation to the Kumeyaay. 


Gabon centre refocuses on emerging diseases 


The International Medical Research 
Centre (CIRMF) in the former 
French colony of Gabon wants to 
have something to celebrate on its 
AOth birthday next year. Many 
hope that the centre's newly 
appointed director-general, 
Jean-Paul Gonzalez, can lead the 
institution — which fell into disarray 
in the 1990s — toa position of 
international excellence in global 
health research. 

Gonzalez, from France's Research 
Institute for Development, will be 
charged with guiding the CIRMF's 
157 staff and US$5-million annual 


Ebola virus, carried by bats, is one of the pathogens studied at the CIRMF. 


in terms of establishing a regional 
or international profile," adds one 
senior researcher in tropical 
disease at a leading African 
research centre, who asked not 
to be named. At the same time, he 
says, he understands all too well 
the difficulties and time involved in 
building up a field research centre in 
Africa. “The new director has plenty 
of scope for raising its profile and 
relevance,” he explains. 

Gonzalez took over on 
1 September from Philippe Blot, who 
has been interim head since 2003. 
Blot was brought in with a remit to 


research budget. The centre, based 


clean up after a turbulent period 


in Franceville in southeast Gabon, 
boasts research infrastructure 
that is rare in Africa, including a 
biosafety level 4 lab for working 
with the world's most dangerous 
pathogens, and has long been seen 
as having the potential to make 
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a worldwide impact. It is funded 
largely by the French oil giant 
Total, which helped bring El Hadj 
Omar Bongo to power as Gabon's 
president in 1967. The centre was 
created jointly by the government 
and Total in 1979, as part of a deal 


giving France access to Gabon's 
oilfields. 

But despite its healthy finances, 
the centre has largely been a white 
elephant, some say. “The CIRMF 
is a very well-funded institution 
that has not obviously delivered 
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in the 1990s, when local 
management and expatriate 
researchers clashed repeatedly over 
administrative and other matters’. 
The situation persisted until around 
2003-04, and was not helped by a 
constant staff turnover, including 
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Bettinger says the review ensures that 
science, not politics, is paramount. “The way 
to avoid the inevitable local politics,” he says, 
“is to kick it to a higher level for independent 
review. 

The turbulent history of the bones reflects 
this. The skeletons — of a man and a woman 
buried in alignment, along with the less-well- 
preserved remains of a third individual — were 
stored at the independent San Diego Museum 
of Man. They were also examined at the Smith- 
sonian Institution in Washington DC. Most 
recently they have been at San Diego State Uni- 
versity, where Native American scientists have 
been conducting morphometric analyses. 

Isotopic analysis of the bones suggests that 
the people ate mainly seafood; anthropolo- 
gists say this indicates they were seafarers, not 
inland dwellers like the Kumeyaay. Still, in May, 
the office of UCSD chancellor Marye Anne 
Fox wrote to Yudof’s office outside the normal 
channels, saying that UCSD executives wanted 
his office to facilitate returning the skeletons to 
the tribes to avoid any “cultural insensitivity”. 

“There are many points of view on the 
cultural affiliation of the remains,” says Art 
Ellis, the vice-chancellor for research at UCSD. 
“We wanted to make sure the office of the Uni- 


Bs 


c ifornia’s Kumeyaay \ 
_ tribes are fighting to 
reclaim the skeletons. 


a ae 


versity of California president took into con- 
sideration all points of view when making the 
final decision” 

Rex Dalton 


three director-generals in almost 
as many years, and a corruption 
scandal. 

With Blot's efforts, the worst is 
now over and a culture of nepotism 
has been cleaned up, says Patrice 
Debré, head of the CIRMF's 
scientific board. But with all the 
past focus on administrative 
matters, the centre has not had 
a coherent research strategy, he 
says. “Research fell behind; there 
was no real scientific leadership. 
That's the leadership that 
Gonzalez's appointment will 
now bring.” 

Gonzalez wants the centre 
to focus on emerging infectious 
diseases, which is his own research 
background, and to become an 
international centre for research 
into Ebola virus, arboviruses 
and other novel pathogens. New 
labs set up at the centre over 
the past five years have notched 
up some success in this area, 
notably the discoveries that bats 


are a reservoir of both Ebola” and 
Marburg? viruses. The centre has 
also strengthened its retrovirology 
research, 

“We are now in an ascendant 
phase for research, after a long 
period where that wasn't the 
case," says Gonzalez. He hopes to 
reinstate parasitology and malaria 
research, areas that were lost by 
staff departures. He also wants to 
add a new theme: how ecosystem 
biodiversity affects the emergence 
and transmission of disease. And, 
to complement the centre's existing 
tropical-forest stations, he plans to 
build a field station in the savannah 
in southern Gabon. 

The common thread running 
through this strategy is to refocus 
the centre on its key strengths. Its 
location in an emerging-diseases 
hotspot makes it ideal for studying 
the emergence and transmission of 
diseases from primates to humans. 

Furthermore, the CIRMF's 
primatology centre is among the 


largest in Africa. It contains around 
450 primates, including gorillas, 
mandrills and macaques, half of 
them housed in a forest enclosure. 
“The natural hosts for SIV [simian 
immunodeficiency virus] and STLV 
[simian T-lymphotropic virus] 

and many other viruses are 
African primates,” says Bettina 
Salle, the veterinary surgeon 

who runs the primate centre. 

“We have the ideal animal colonies 
and geographical location for 
research into the viruses in their 
natural hosts.” 

Debré notes that the centre can't 
begin to compete with international 
clinical research in retroviruses 
suchas HIV, in part because it lies 
too far from major hospitals. So 
instead it hopes to capitalize on 
the current back-to-basics shift 
in AIDS research, to position itself 
as a place to study wild retroviruses 
in primates and carry out HIV 
research and vaccine trials in 
primate models. 
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The CIRMF should take 
advantage of the infrastructure 
provided by the government and by 
Total, says Debré, but balance that 
by pursuing outside, independent 
research funding. Gonzalez says 
he hopes to have obtained a50% 
increase in the centre's research 
budget and staff by 2010. “A test of 
Gonzalez will be how successful he 
is here," says Debré. 

One former CIRMF researcher 
praises Gonzalez's track record, 
but reckons the director-general 
faces an uphill battle. Salle is more 
optimistic: logistics and cultural 
challenges mean that working at 
the CIRMF is “not always easy", she 
says, “but we can do great things 
here. Gabon has a great tool for 
science.” 

Declan Butler 


1. Butler, D. Nature 394, 113 (1998). 

2. Leroy, E. M. et al. Nature 438, 575-576 
(2005). 

3. Towner, J.S. etal. PLoS ONE 2, e764 
(2007). 
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German authority halts primate work 


Germany's constitution guarantees its citizens 
the freedom to conduct research — but local 
authorities in the northern city of Bremen are 
forcing a leading neuroscientist to halt his pri- 
mate experiments. A court will probably now 
have to decide whether the controversial rul- 
ing violates federal law. 

Andreas Kreiter at the University of Bremen 
uses 24 macaques to study cognitive processes 
in the mammalian brain. Germany’s largest 
animal-protection group, the Animal Welfare 
Association, has for years campaigned against 
the experiments, claiming that they are intol- 
erably painful and have no short-term thera- 
peutic use. 

Local politicians have become increasingly 
sympathetic to that view. Last year, in a move 
criticized by scientists as a grab for votes, 
Bremen’s parliament called on the state gov- 
ernment to ban Kreiter’s primate research (see 
Nature 446, 955; 2007). After regional elections 
in May 2007, the newly formed Social Demo- 
crat-Green coalition government agreed not 
to reapprove his experiments when his aurrent 
licence expires later this year. 

On 15 October, Kreiter was o fficially 
informed by the senate of health — the local 
authority in charge of approving animal exper- 
iments — that his licence will na be renewed. 
Referring to “changed societal values’, the 
authority argued that the experiments were 
“ethically unjustified” because they address 
long-term scientific questions rather than help 
develop specific medical therapies. 

Kreiter counters that the authority's reason- 
ing is subjective and will not hold legally. “It is a 
purely arbitrary ethical assessment that doesn't 
go into the nature and purpose of my work at 
all” he says. 

The authority’s logic is 
“pretty strange’, agrees Rein- 
hard Fischer, who oversees the 
primate experiments on behalf 
of Wilfried Miller, the univer- 
sity rector. Fischer notes that the 
ruling ignores a positive judge- 
ment rendered last year by an 
expert commission comprising scientists and 
representatives of animal-welfare organizations. 
Bremen’s government had asked the commis- 
sion to evaluate Kreiter’s work, and it voted in 
favour of continuing the experiments — deem- 
ing them scientifically important and in full 
accordance with animal-protection laws. 

Matthias Kleiner, the president of Germany’s 
main research-funding agency, the DFG, 
and many neuroscientists have criticized the 


"This is a flagrant case 
of political interference 
with approval 
procedures, and with 
the general freedom to 
do science.” 


Andreas Kreiter plans to appeal the controversial decision to stop his studies of cognition in macaques. 


decision. “We're all very concerned about what's 
going on in Bremen,’ says Stefan Treue, direc- 
tor of the German Primate Center in Gottingen 
and coordinator of EUPRIM-Net, a European 
Union-funded network of eight large primate 
centres. “I just can’t see why what’s perfectly 
fine in one place should be 
unethical in another. To me, 
this is a fagrant case of politi- 
cal interference with approval 
procedures, and with the gen- 
eral freedom to do science” 
Nine laboratories across 
Germany conduct primate 
experiments; in recent years 
the number of animals involved has ranged 
between 1,300 and 1,800. About 150 of those 
are for basic research; the rest are used for other 
experiments such as drug toxicity tests. 
Kreiter’s experiments record the activity 
of single neurons in the brains of macaques 
while they complete particular behavioural 
tasks. This cannot be usefully done on ani- 
mals with less-developed brains, such as mice, 
or with available non-invasive brain-imaging 
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methods, says Kevan Martin, a neuroscientist 
at the Swiss Federal Institute of Technology, 
Zurich, whose own research on macaques is 
also threatened by a ban (see Nature 453, 833; 
2008). “By taking away monkey experiments 
you certainly greatly diminish the potential 
benefits of fundamental research like Kreiter’s 
for future clinical applications,’ says Martin. 

The ruling jeopardizes the existence of 
Kreiter’s centre, including the work of a dozen 
or so postdoctoral researchers and PhD 
students. Also at risk is his group’s partici- 
pation in a €2.3-million (US$2.9-million) 
project, funded by the German science ministry 
and aimed at developing key technologies for 
neuroprostheses. 

Kreiter’s current licence to conduct experi- 
ments runs out on 30 November. To be able 
to continue his research beyond that date he 
will appeal the decision and seek an interim 
injunction at court. 

University officials say they will support him 
and, if necessary, take the case to Germany’s 
Federal Constitutional Court. a 
Quirin Schiermeier 
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Salmon study sparks row over dams 


Research that argues dams have no direct effect 
on the migration of juvenile salmon is roiling 
waters in the US Pacific Northwest. 

The study, published this week in PLoS 
Biology, uses a new way to tag fish to compare 
the heavily dammed Columbia River system 
with the free-flowing Fraser River to the north 
in British Columbia, Canada. After hatch- 
ing upriver, salmon smolts fared equally well 
on their journeys down the two rivers to the 
ocean. But that’s where the agreement ends. 

Salmon migration remains at the heart of an 
ongoing court battle, which pits environmental- 
ists who want to see some of the dams removed 
against federal agencies charged with maintain- 
ing the dams, providing power and protect- 
ing salmon. Thirty-one federal dams on the 
Columbia River and its tributaries collectively 
provide some 60% of the region’s electricity. 

The Bonneville Power Administration, based 
in Portland, Oregon, has spent billions of dollars 
making the dams more friendly for fish, and the 
latest plan, issued on 5 May, proposes further 
modifications to dam operations. Taken at face 
value, the new research might suggest that the 
investment is paying off and that other factors 
are suppressing the 13 salmon populations pro- 
tected under the Endangered Species Act. 

But environmentalists and several salmon 
biologists pounced on the study, suggest- 
ing that industry funding might have biased 
the results. These critics question the value 
of comparing the two rivers and say that the 
study doesn’t even address what many think is 
the dams’ biggest effect: stressed smolts dying 
after they reach the ocean. 

Still, the study explicitly adcnowledges these 
limitations. Lead author David Welch, founder 
of Kintama Research in Nanaimo, British 
Columbia, and an adjunct pro- 
fessor at the University of Victo- 
ria, has received nearly US$4.2 
million from Bonneville Power 
for his work, although his larg- 
est funder is the Gordon and 
Betty Moore Foundation. How- 
ever, he says that his funders 
had no role in vetting the study. “Everybody 
assumed that we would simply see much higher 
mortality in the dammed system — we didn't 
find that,’ he says. “We don't want to oversell it 
and say it definitively answers these questions, 
but it’s pretty damn surprising” 

But two of Welch’s co-authors, Shaun 
Clements and Carl Schreck, who did their 
work at Oregon State University in Corvallis, 
are concerned that the study is being oversold. 
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“Everybody assumed 
that we would simply 
see much higher 
mortality in the 
dammed system." 


Free-flowing water might not aid chinook salmon as much as some believe. 


Clements says that a press release issued by PLoS 
Biology with the headline “Dams make no damn 
difference to salmon survival” goes too far. “This 
study doesn't take dams off the hook,” he says. It 
may point to problems in the Fraser rather than 
clearing dams on the Columbia, he adds. 


Sound trackers 

Tagging studies typically rely on passive sen- 
sors implanted in the fish, but the sensors 
have a short range and can be counted only 
as the salmon are funnelled past receivers at 
the dams. In contrast, Welch’s company uses 
powered transmitters that send acoustic sig- 
nals to receivers, allowing the 
researchers to track the salmon 
in free-flowing rivers and along 
the coast. 

The team tagged hatchery 
spring chinook and wild steel- 
head on the Fraser River from 
2004-06 and then compared 
those results to other tagging work done in 
the Columbia. In 2006, the team also tagged 
hatchery spring chinook fish on the Colum- 
bia’s Snake River tributary and tracked fish 
as they navigated eight major dams to the 
ocean. The numbers of surviving fish var- 
ied widely, Welch says, but were statistically 
indistinguishable between the river systems 
(D. W. Welch et al. PloS Biol. 6, e265; 2008). 

Michele DeHart, manager of the Fish Passage 
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Center in Portland, which tracks salmon 
statistics, says the comparison is “seriously mis- 
leading”. She points out that the Fraser River has 
its own problems, including an infestation of 
bark beetles, which could affect water quality, 
and that federal dam operators have been under 
a court order since 2005 help juvenile salmon 
by spilling water over the dams during spring 
and summer migrations. 

Ed Bowles, the top fish biologist for the 
Oregon Department of Fish and Wildlife, says 
the results merely underscore the notion that 
delayed mortality is the real killer. He points 
to work showing that the number of salmon 
returning upriver to their original spawning 
grounds — years after their initial migration 
to the ocean — is several times higher for fish 
that navigate fewer dams in the lower reaches 
than for populations that must traverse eight 
dams to reach the sea (H. A. Schaller and 
C. E. Petrosky North Am. J. Fish. Mgmt 27, 
810-824; 2007). 

The additional energy and stress required 
to move through the dammed system seems 
to inhibit survival in the ocean, he says. “This 
paper doesn't really do anything to dispel that 
working hypothesis.” 

Welch says he is now looking into the issue of 
delayed mortality by tracking the same salmon 
populations after they enter the ocean and head 
up the coast to Alaska. a 
Jeff Tollefson 
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European shortage will 
continue into 2009. 
www.nature.com/news 


Older scientists publish more papers 


If the mind is the first to go, it doesn’t seem to 
show up in the bibliography. A new study sug- 
gests that older scientists publish more than 
their younger counterparts. 

A research team from the University of Que- 
bec in Montreal, Canada, looked at the publi- 
cation record of nearly 14,000 professors and 
found that those in their 50s and 60s published 
almost twice as many papers each year as those 
in their early 30s. What’s more, the papers 
are just as highly cited, accord- 
ing to the work, which appears 
on the arXiv preprint server 
(Y. Gingras et al. http://arXiv.org/ 
abs/0810.4292; 2008). 

The survey, say its authors, is the strongest 
evidence yet that older scientists are at least 
as productive as their juniors, if not more 
productive. “We should not underestimate 
the older researcher,’ says team leader Yves 
Gingras, a historian of science. 

Gingras became interested in the topic after 


reading that the average age for first grants 
from the US National Institutes of Health in 
Bethesda, Maryland, rose from 34 in 1970 to 
nearly 42 in 2004. Hs team looked at publica- 
tion data for 13,680 scientists and scholars in 
Quebec between 2000 and 2007, noted each 
researcher's age, the number of papers pub- 
lished and the citation record according to the 

Thomson Reuters citation index. 
The findings held across both the sciences 
and the humanities. Unsur- 


“It's better to keep prisingly, the team found that 
researchers and 


let them work.” 


productivity increased sharply 
throughout the late 20s and early 
30s. But it continued to rise at a 
slower rate for researchers in their 40s and 50s. 
Active professors in their 60s continued to pro- 
duce about three papers per year. The average 
impact factor of each paper fell during the rise 
in productivity, but turned upward again for 
papers published by older authors. Citations, 
meanwhile, remained stable, suggesting that 


older researchers were both more prolific and 
more highly cited than their juniors. 

The survey is one of the largest of its kind, 
says Wolfgang Glanzel, a director at the Policy 
Research Centre for R&D Indicators at the 
Dutch-speaking Catholic University in Leuven, 
Belgium. He would like to know how many 
of the papers published by older authors are 
review articles, and what effect students had 
on their older co-authors’ output. Both could 
be important factors in the apparent high pro- 
ductivity of senior scientists, he says. 

“High impact is not the same as originality,” 
adds Anthony van Raan, who studies science 
and technology policy at Leiden University 
in the Netherlands. Still, he says, the research 
makes a compelling case against mandatory 
retirement, which is commonplace through- 
out Europe. “It’s better to keep [researchers] 
and let them work,’ he says. “They can be very 
productive” = 
Geoff Brumfiel 
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Rare greenhouse gas adds 
to climate concerns 


Scientists are recommending that a rare 
but potent greenhouse gas should be 
included in future climate agreements after 
confirming that the gas is about four times 
more abundant than previously believed. 

Nitrogen trifluoride (NF;) is commonly 
used to etch microcircuits in plasma-screen 
televisions and other flat-panel displays. As 
a greenhouse gas, NF, is about 17,000 times 
more efficient at trapping heat than carbon 
dioxide, but very little was thought to escape 
into the atmosphere. 

A team led by Ray Weiss of the Scripps 
Institution of Oceanography in La Jolla, 
California, confirmed that atmospheric 
concentrations have risen more than 20-fold 
during the past 30 years after analysing air 
samples from coastal stations in California 
and in Tasmania, Australia. 

The scientists reported that in 2008 some 
5,400 tonnes of NF, were present in the 
atmosphere (R. E. Weiss et al. Geophys. Res. 
Lett. doi:10.1029/2008GL0359 13; 2008). 
For a longer version of this story, see 
http://tinyurl.com/6a6gta 


Rice pathogen is added to 
list of bioterror agents 


The US Department of Agriculture 

(USDA) this month added a rice pathogen 

to its national security watch list, despite 
objections from several prominent scientists. 

USDA’ Animal and Plant Health 
Inspection Service has listed the bacterium 
Xanthomonas oryzae pv. oryzae, which 
causes leaf blight, as a ‘select agent’ that 
could be used for bioterrorism. 

Several researchers who work with the 
microbe argued against the listing, saying 
that it would have difficulty establishing 
itself in the United States, and that those 
labs working with it have perfect safety 
records dating back more than 20 years. 

Researchers have until 17 November to 
notify the government if they possess the 
pathogen, and until 14 April 2009 to come 
into full compliance with the regulations. 

“That’s not a lot of notice,” says Pamela 


Blight in rice caused by Xanthomonas oryzae. 


India makes history with launch of Moon mission 


India’s first lunar spacecraft is set to 
swing into orbit round the Moon on 
8 November. 

The Indian Space Research 
Organisation (ISRO) successfully 
launched Chandrayaan-l (pictured) on 22 
October from the Satish Dhawan Space 
Centre in Sriharikota, southern India. 

The craft is carrying six foreign and five 
Indian scientific instruments designed to 
create a three-dimensional map of the 
Moon and study the lunar soil, among 
other things. The spacecraft will also 
search for frozen water and measure the 
abundance of helium-3. 

The ISRO is now planning 
Chandrayaan-2, which would take a 
Russian-built rover to the Moon's surface 
by 2012. The agency also aims to send 
two Indians into space by 2015, ina 
mission that will cost roughly 120 billion 
rupees (US$2.4 billion). 


Ronald, a rice researcher at University of 
California, Davis, who opposed the listing. 
“All the labs are scrambling to find out what 
it means to their research programmes.” 


Fall in profits prompts 
major job cuts at Merck 


Pharmaceutical giant Merck is to cut 7,200 
jobs, or roughly 12% of its workforce. 
Around 40% of the cuts will be in the United 
States, many of them from management 
as Merck trims the number of senior and 
mid-level executives by 25%. The company, 
based in New Jersey, also plans to close 
basic-research sites in Tsukuba (Japan), 
Pomezia (Italy) and Seattle (Washington). 

These cuts come on top of the axing of 
10,400 positions announced in 2005, which 
Merck says was “substantially complete” as 
of last month. 

Merck's third-quarter profits were 
down 28%, and spending on research and 
development (R&D) dropped by 19% 
compared with 2007, to $1.2 billion. But the 
company says that R&D spending actually 
rose by 2% if expenses from restructuring 
efforts and the 2007 acquisition of 
NovaCardia are taken into account. 


Swedish basic research 
receives funding boost 


The Swedish government has announced 
a record investment in basic research, 
much of which will be directed to fields 
such as stem cells, climate modelling and 
nanotechnology. 

Introduced on 23 October, the bill 
provides almost 15 billion Swedish krona 
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(US$1.9 billion) for research and innovation 
between 2009 and 2012. Sweden currently 
spends 25 billion krona a year on research; 
that figure is expected to rise by 20% by 2012. 

Most of the new funds will be distributed 
to universities through a merit-based system. 
Gunnar Oquist, head of the Royal Swedish 
Academy of Sciences in Stockholm, says that 
although the initiative would reverse years of 
declining research budgets, he worries that 
its narrow focus could stifle creativity. 

The bill will now be taken up by the 
parliament, and is expected to pass early 
in 2009. 


NASA clamps down on 
conference attendance 


NASA has put a moratorium on 
participation in conferences in an effort to 
comply with an authorization bill recently 
enacted by Congress. 

Signed into law by President Bush on 15 
October, the act limits conference spending 
to US$5 million in the 2009 fiscal year. That 
figure represents more than a two-thirds 
reduction from conference spending in 2008. 

The moratorium, announced earlier this 
month, covers sponsorship of conferences, 
as well as travel and admission fees. It 
applies to NASA staff and some contractors, 
such as employees at the Jet Propulsion 
Laboratory in Pasadena, California. 
Scientists receiving NASA grants or 
sponsorship have not been affected so far. 


Correction 

The News Feature ‘What lies beneath’ (Nature 
455, 724-725; 2008) incorrectly stated that 
nitrogen-fixing bacteria are 1-2 millimetres in 
length. They are, in fact, 1-2 micrometres long. 
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Where should the drug industry go to find new ideas? In the first of two features, Alison Abbott 
asks if the future lies in systems biology — a field that attempts to piece together ‘everything’. 
In the second, David Cyranoski looks at drug companies’ attraction to China. 


A FIENDISH PUZZLE 


fit is hard to make something from 

nothing, then it can be just as hard to make 

something from everything. But that, in 

essence, is what many pharmaceutical 
companies are trying to do as they seek new 
drug targets by integrating the massive sprawl 
of biological information now available. 

The puzzle is like a jigsaw with an unknown 
number of pieces and, as yet, no edges. So 
when Cristiano Migliorini of the drug giant 
Roche saw a promising drug target for type 2 
diabetes emerge from hundreds of millions of 
data points, he felt a celebration was in order. 
He picked up the phone and invited his 20 or 
so collaborators from research institutions all 
over Switzerland to a party. 

The celebration — a good dinner, drinks 
and some lively pub games — might have 
been considered premature. It is a long and 
tortuous route from an interesting protein to 
a new and effective drug. But since the party, 
this protein has been put through its paces at 
Roche's drug-development labs in Basel, and 
candidate drugs that bind to it are already 
lined up for testing. The target could still 
fail, but in Migliorini’s mind, the results have 
already shown that the mega-data-crunching 
approach of ‘systems biology’ — new, modish 
and increasingly adopted by pharmaceuti- 
cal firms — can piece together a meaningful 
picture from this colossal biological puzzle. 


“And that,’ says Migliorini, “is worth at least a 
good meal and a game of pool”. 

The crisis-ridden pharmaceutical industry 
desperately wants something to celebrate. 
The glory days of the blockbuster drug seem 
to be over. When genomics matured at the 
turn of the century, much of the industry was 
convinced that individual genes would emerge 
as the new drug targets. But that reductionist 
bubble soon burst: the more that geneticists 
and molecular biologists have discovered, the 
more complicated most diseases have become. 
As individual genes have fallen out of favour, 
‘systems’ — multitudes of genes, proteins and 
other molecules interacting in an almost infi- 
nite number of ways — have come into vogue. 
Systems biology is an attempt to make sense of 
all these data. 

Some researchers and analysts were cynical 
when systems biology was hyped as the sav- 
iour of the failing research and development 
pipelines. “Rightly so, perhaps,” says Giulio 
Superti-Furga, who studies complex protein 
interactions and is head of the Research Center 
for Molecular Medicine in Vienna. “A few 
years ago, systems-biology proponents were 
writing reviews that triggered unreasonable 
expectations for advances in medicine” Many 
companies have therefore been cautious about 
making a big investment. Like Roche, most are 
testing the water with defined projects, usually 


Cristiano Migliorini has seen systems biology deliver promising drug targets. 
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in collaboration with top academics or small 
computational companies. 

Although systems biology is far from 
proven as a drug-discovery tool, it has many 
advocates and its wider adoption seems to be 
inevitable. If leading biologists are concluding 
that disease can only be understood by tackling 
the entire, messed-up system, pharmaceuti- 
cal companies arguably have little choice but 
to keep up. “Industry really needs to dive in; 
Superti-Furga says. 


Never-ending networks 

At its heart, systems biology is about gathering 
unprecedented amounts of data from cells, 
then making sense of it through mathemati- 
cal models. At its most sophisticated, it might 
involve the high-throughput collection of 
molecular data, such as DNA sequences, 
RNA molecules, proteins and metabolites as 
well as more descriptive data such as clinical 
diagnoses and drug responses. These data are 
then assimilated into computational models of 
cellular processes which, as molecules change 
location and function every microsecond, 
must also accommodate the dimensions of 
time and space. 

It is not enough to mimic what’s known 
about the cell — such models must also pre- 
dict what is unknown so that scientists can test 
their hypotheses. A model of a liver cell, for 
example, might reveal which of 100 interest- 
ing drug candidates will prove beneficial and 
which will be toxic. Knowing when to pull a 
doomed drug from the pipeline could spare 
a company a lot of expensive work. “The big- 
gest value of systems biology immediately is 
as an aid to decision-making,” says David de 
Graaf, who has worked in systems biology 
programmes at Pfizer, AstraZeneca and now at 
Boehringer Ingelheim. “And a drug company 
is a decision-making machine.” 

Although systems biologists want it all, for 
now they cannot get it. “We'll not have every- 
thing a hundred years from now,’ says Colin 
Hill, chief executive of the biotech company 
Gene Network Sciences in Cambridge, Mas- 
sachusetts, which uses supercomputers to 
model complex biology for drug companies. 
“Even after two decades of gathering and 
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archiving molecular data, barely 5% of 
the total circuitry inside the cell is known.” So 
systems biologists must choose between two 
approaches when developing their models. 
In one, they scale back their ambition, paring 
down the number of biochemical pathways 
being modelled to just a few well-known ones. 
In the second, they bluff the bits they do not 
know, trying to model ‘everything; and using 
sophisticated computational tricks to compen- 
sate for absent data. 

At the end of the day, the choice of approach 
depends on what question researchers want to 
ask of their models. It’s a bit like the conun- 
drum in Douglas Adams's Hitchhiker's Guide to 
the Galaxy, where the computer Deep Thought 
spends 7.5 million years computing an answer 
to the most horribly imprecise “ultimate ques- 
tion of life, the Universe and everything” — 
only to come up with the answer “42”. If the 
question is narrow — ‘which surface proteins 
are affected when insulin activates receptors on 
the pancreas cell?’ — then it may be sufficient 
to model only those networks known to be 
involved in insulin signalling. If the question 
is broad — ‘what would happen to an animal 
fed drug X? — then the everything’ approach 
is more appropriate to see what falls out, be it 
stronger muscles or liver failure. 


The meaning of life 

Drug companies are divided on which of 
the two approaches is better, and Migl- 
iorini is clear about which he prefers. 
“We take modelling in small 
doses — a defined number of 
well-understood pathways,” he 
says. “We have a prag- 
matic approach — we 
cart say “42’ to man- 
agement.” Cell biologist 
Willy Krek, an academic 
partner of Miglorini’s at one of 
Switzerland’s foremost universi- 
ties, the ETH in Zurich, adds: “It is 
strategically better to get a rich data set ona 
limited network than to take measurements 
for the rest of your life, always saying they are 
still not enough” 

Even the scaled-back approach to systems 
biology takes massive effort. Migliorini’s col- 
laboration began in 2006 as part of Switzer- 
land’s big systems-biology research programme 
now known as SystemsX.ch. Migliorini calls it 
“an industry-academic collaboration beyond 
cliché’, referring to what is in fact the largest 
such partnership between pharmaceutical com- 
panies and academia in biology that Switzer- 
land has ever seen. It involves several top Swiss 
research groups, plus academic and industrial 
partners elsewhere, at a cost of 9 million Swiss 
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“we'll not have 
everything 4 
hundred years 


mn now.” 
c — Colin Hill 


Colin Hill uses his background in theoretical 
physics to model complex biological systems. 


francs (US$8 million) over three years as well 
as undisclosed internal spending. 

In searching for its diabetes drug, the 
consortium has built a model focused on the 
500 or so proteins — called the ‘surface pro- 
teome’ — on the coat of pancreatic f cells, the 
cells that secrete insulin when blood glucose 
concentrations rise. Migliorini and his co- 
workers are working on the concept that some 
people who overeat develop type 2 diabetes 
because their f cells cannot keep up with the 
body’s demands. The team is seeking to identify 
surface proteins that stop responding to envi- 
ronmental signals such as glucose metabolites, 
as these may be responsible for transforming 
the overworked cells into diseased cells. 

The scientists use almost every high-tech 
method imaginable to gather data on pro- 
teomes and the cellular networks they link 
into. Consortium member Rudi 
Aebersold from the ETH devel- 
oped one such method using 
mass spectrometry that 
separates out glycopro- 
teins — the proteins 
involved in key proc- 
esses such as protein fold- 
ing or cell-cell signallin g 
— from the background of other 
proteins’. Importantly, the tech- 
nique quantifies as well as identifies pro- 
teins, providing a mdecular fingerprint 
for acell in a paticular stage of health or 

disease. Quantification is essential to sys- 
tems biology, whereas in the past, molecular 
biologists were happy just to see whether or not 
a protein was present. 

To flesh out the proteomic information, the 
researchers also generate reams of other data, 
either experimentally or by pulling them out 
from public and commercial databases, to feed 
into their computational model. They then work 
out which surface proteins converge on the same 
intracellular pathways. At the end, they have a 
massive, moving, knitting-ball of a network in 
which every protein is linked to every other 
protein it interacts with. The hope is that this 
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can point to the ‘nodes’ — the key points in a 
network that have disproportionate influence. 
Then researchers can ask questions of the 
model. For example, if the team suspects that 
the disease is being caused by a particular aber- 
rant glycoprotein, they strip that protein from 
the model. If correct, the model should then 
mimic the knock-on effects in the circuitry that 
are known to occur. That glycoprotein then 
becomes a potential drug target or, perhaps, a 
biological marker of disease progression. 


Quick results 
At the start, not everyone at Roche expected 
the systems approach to succeed within 
the three years the collaboration was ini- 
tially given. In fact, Migliorini’s first hit, last 
autumn, came within a year of his project 
starting. “It was a great feeling to see our 
engineering-style approach — multiple 
inputs, each comprising amounts of data 
that make you sweat — spit out a clear sin- 
gle target,” he says. Here, luck intervened: 
the protein in question (its identity is still 
guarded) had been investigated in relation 
to an entirely unrelated disease, so the tools 
and compounds for studying it were already 
available. “The whole process went lightning- 
fast,” says René Imhof, director of research 
for Roche in Basel. Roche’s academic partners 
are also happy, because the collaboration was 
formed at exactly the time they were starting 
to feel the limitations of the traditional genes- 
will-explain-disease reductionism. 

Across the Atlantic, in the Boston area of 
Massachusetts, Peter Sorger and Douglas 
Lauffenburger have been feeling similar con- 
straints. Sorger says that he used to regularly 
raise hackles with his outspoken criticism 
of traditional genomics. As the region is 
home to the Whitehead Institute for Bio- 
medical Research, one of the most impor- 
tant sequencing hubs of the Human Genome 
Project, those hackles were being raised on 
some particularly high-level necks. 

Sorger and Lauffenburger now operate com- 
plementary systems-biology labs — Sorger 
at Harvard Medical School in Boston and 
Lauffenburger at the Massachusetts Institute 
of Technology in Cambridge — and have been 
asked to collaborate with numerous pharma- 
ceutical companies, usually in the area of 
cancer. Both have worked with AstraZeneca, 
for instance, to understand the action of the 
company’s anticancer drug gefitinib (Iressa). 
Gefitinib was a much-heralded drug designed 
to hit the ERBB1 receptor when it is mutated, 
as it frequently is in various cancers. But 
few patients responded to the 
drug, and genetic sequencing 
showed that mutations in the 


receptor could not predict who these patients 
would be. “By focusing down on just the gene, 
the genomic approach had taken the target out 
of its physiological context,” says Sorger, “and 
the physiology needed to be put back with sys- 
tems biology.” 

Lauffenburger attempted to do this by 
looking at the whole life cycle of the errant 
receptor and how it connected with bio- 
chemical pathways inside the cell. Like many 
membrane receptors, ERBB1 switches off its 
signalling capacity by removing 
itself from the surface of the cell, 
and Lauffenburger theorized that 
sensitivity to gefitinib 
could depend on how 
well this process works. 
His team collected data 
on genes, transcription, 
protein phosphorylation and 
the internalization of the recep- 
tor from cells that were either sen- 
sitive or insensitive to gefitinib and built 
a model to mimic the circuitry associated 
with the receptor. 

The model suggested that mutations in 
ERBB1 were almost irrelevant: gefitinib would 
work only in cells that had inefficient inter- 
nalization of ERBBI (tef. 2). So far this has not 
generated new biomarkers for the company’s 
clinical trials, but it has shown that compli- 
cated systems biology may be needed to select 
patients for drug treatments. 

Like Migliorini and many others in systems 
biology, Lauffenburger has an engineering 
background. Others, such as Hill, have swept 
in from theoretical physics. Hill turned to biol- 
ogy after a stint at the interdisciplinary Santa 
Fe Institute in New Mexico, where he realized 
that physicists like himself had become com- 
fortable contemplating the vastness of the Uni- 
verse or the minuscule nature of the subatomic 
world, but were distinctly uncomfortable with 
a mid-sized affair such as life. “Life was always 
too complicated a subject for us,’ he recalls. 

Hill’s type have brought with them the 
sort of Bayesian, or probabilistic, analyses 
that have been applied in other areas that 
have huge and incomplete data sets, such 
as astronomy. They use these techniques to 
tackle the second approach to systems biol- 
ogy, relying on computer models to fill in the 
gaps between relatively sparse data points. 
Researchers then put real data into the model 
and modify it to fit the data better, repeating 
these adjustments until all possible permuta- 
tions have been explored. 

This type of modelling requires a scale of 
computing effort analogous to that 
required to predict weather and 
understand global warm- 


“P hysiology 
needed to be 
put back with 
Systems biology,” 
— Peter Sorger 


PHARMACEUTICAL FUTURES NEWS FEATURE 


ing. The number of ways in which one gene 
or protein can link to another in the tangled 
web of intracellular pathways and networks is 
astronomical. “If you take 22 variables, such 
as gene, transcript or clinical measurements, 
they may each affect an average of maybe 250 
points in anetwork — that means there could 
be 250” possible combinations,” says Hill. “We 
can only do this with supercomputing.” Hill’s 
company is now cranking out potential 
drug targets, which it patents and then 
peddles to pharmaceutical companies. 
Industry giant Merck is known 
for its bold move to incorpo- 
rate the ‘everything’ approach 
into systems biology into 
its drug-development 
programmes. In 2001, 
it acquired the Seattle- 
based genomics com- 
pany Rosetta Inpharmatics to 
develop systems genomics in 
house. “The scale of the biologi- 
cal information we have today is unprec- 
edented,” says Eric Schadt, Rosetta’s science 
director and a former physicist. “Fifty thou- 
sand or so transcripts, millions of mutations, 
hundreds of clinical endpoints — a high-per- 
forming computing environment is essential 
and we have one of the world’s biggest.” 


Growth factors 

Earlier this year, the group used the model to 
show that its systems approach had identified 
key disease-causing gene networks in meta- 
bolic diseases’. The paper was published back- 
to-back with another that had used a systems 
approach to identify three obesity-related 
genes that had not sprung out of genomics 
data alone’. “The surprise here was to find 
that not just a handful of genes were causally 
implicated in the disease, but hundreds — 
whole pathways were involved,” says Schadt. 
“So the challenge is to hit the nodes in net- 
works where implicated pathways overlap.” 
The company says that up to 30% of targets for 


Systems biologist Peter Sorger has criticized 
genomics techniques. 
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various diseases in its early pipeline have been 
identified through Schadt's techniques, and 
that a few are in early-phase clinical trials. 

But some people in ind ustrial and 
academic circles remain hesitant about 
systems biology. No one can be sure that it 
will really increase the number of targets or 
biomarkers that make it through clinical test- 
ing. Still, it is a gamble that almost all campa- 
nies seem willing to make, even if investment 
levels are sometimes small. In a report pub- 
lished earlier this year, industry analysts at 
PriceWaterhouseCoopers argue that the 
pharmaceutical industry needs to rely much 
more on systems biology if it is to survive the 
failing-pipeline crisis and predicts that the 
approach will have become more prevalent 
by 2020. 

Stephen Friend, Merck’s vice-president for 
oncology, thinks that any hesitancy will be 
overcome when the modelling becomes so 
predictive that the toxicity and efficacy of a 
potential drug can be forecast very accurately 
even before an experimental animal is brought 
out if its cage. “The next three to five years will 
provide a couple such landmark predictions 
and wake everyone up,” he says. 

The challenge for systems biologists is 
that the system is being stretched at all ends 
of the imagination. There will inevitably be 
new categories of biological information to 
collect and feed into models. Measurements 
will be made on finer and finer scales so that 
individual molecules can be identified and 
counted in single cells with greater speed and 
efficiency. This information will be integrated 
into grander models, although hopefully not 
ones that take 7.5 million years to compute 
answers. At a meeting in Japan in February, 
scientists signed up to a dedaration calling for 
a grand challenge to create a virtual represen- 
tation of the physiology of the entire human 
within three decades. To get that far that fast 
will require formidable advances in technol- 
ogy development, and even greater computa- 
tional power and confidence. 

But for the time being, most systems 
biologists are happy to have their questions 
answered, or at least considered, by drawing 
boundaries around ‘everything’ such that 
it is contained within the walls of a virtual 
cell. “In fact, I don’t think we have a choice,” 
Migliorini says. Life, the Universe and almost 
everything is about the most that biologists 
can handle for now. : 
Alison Abbott is Nature's senior European 
correspondent. 
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rom the outside, China may not seem an 
obvious place to look for a pharmaceu- 
tical breakthrough. Last year, the head 
of the State Food and Drug Agency was 
executed for taking bribes. This year, more 
than 80 people died in the United States alone 
after taking a contaminated 
version of the blood-thinner 
heparin, the main ingredient for 
which had been imported from 
China. The number of domesti- 


“You can consider 
what we are doing in 
China an experiment. 


companies recognize that a research branch in 
China helps build connections with regulatory 
agencies and gain access to a market that some 
analysts predict will be the world’s second larg- 
est after the United States by 2020. 

The combination of desperation outside 
China and promise within has 
convinced almost every big 
pharmaceutical player, includ- 
ing Roche, Novartis, Glaxo- 
SmithKline, Eli Lilly and Pfizer, 


Ly 
cally discovered drugs that the We don't deci to own to collectively invest hundreds 
Chinese pharmaceutical sector everything. of millions of US dollars into 
has turned into internationally — Tony Zhang research operations there over 


successful products? Zero. 

But these are desperate times. Pharmaceuti- 
cal companies around the world face shrinking 
incomes as their leading drugs go off-patent 
and generics come online. The cost and com- 
plexity of new drug discovery and clinical 
testing is soaring and drug firms are taking a 
machete to their research and development 
(R&D) programmes. 

Looked at in this light, certain features 
of China begin to look attractive. A spate of 
returnees, often with experience from West- 
ern pharmaceutical companies, has put talent 
on the ground. A vibrant contract research 
organization (CRO) industry — with more 
than 400 companies in Shanghai alone — 
allows for speedy changes of direction with 
minimal investment. Clinical trials can cost 
half or even one-fifth of what they would in 
the United States. The government is show- 
ing that it is serious about meeting interna- 
tional standards for drug development with a 
6-billion renminbi (US$900 million) invest- 
ment aimed at good practices in laboratory and 
clinical practice (see Nature 455, 142; 2008). 
And, perhaps most importantly, multinational 


1168 


the past two to three years. The 
companies are somewhat cagey about how 
much they are investing at a time when they are 
laying off employees elsewhere, and when there 
is no guarantee of a return. But Kenneth Chien, 


Jingwu Zang says his firm was attracted to China 
by the potential workforce and drug market. 
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an expert in cardiovascular medicine and an 
adviser to several large drug and biotechnology 
firms working in China, calls t “Basel on ster- 
oids’, referring to the throng of pharmaceutical 
companies in Switzerland. 

Expansion into China is 2 experiment, allow- 
ing companies to try out new models for seek- 
ing drugs, often relying heavily on outsourcing 
to CROs. “Every one is doing different things. 
Eventually we'll figure it out,” says Li Chen, head 
of the Roche R&D centre in Shanghai. But there 
is no proven leadership, points out Mingwei 
Wang, director of China's National Center for 
Drug Screening and a pharmacologist at the 
Shanghai Institute of Materia Medica, the drug- 
research institute of the Shanghai Institutes for 
Biological Sciences. “Pharmaceutical companies 
in China have all the problems of the United 
States plus the lack of experience. There are no 
success stories, nobody has built an Amgen or 
Genentech in China” 


Hi-tech high-rises 

Most of Shanghai’s pharmaceutical boom is 
happening in the Zhangjiang Hi-Tech Park in 
Pudong, east of the Huangpu river. It is in the 
‘new’ part of Shanghai, symbolized by the Ori- 
ental Pearl Tower, with wide streets, spacious 
new residential complexes and golf courses. 
Zhangjiang boasts companies selling almost 
every service a pharmaceutical or biotech 
company could want, including the produc- 
tion of active ingredients, genomics, analytical 
and combinatorial chemistry, and preclinical 
toxicology testing. It is a natural place for a 
drug company to want to set up in. 

But not GlaxoSmithKline (GSK). GSK plans 
to go to Puxi, the area west of the Huangpu, 
to build a massive permanent home for its 
research branch, launched in May 2007. Puxi 
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GSK 


has shikumen st one-gate 
dwellings and fashionable 
restaurants, bars, and cafes 
tucked into alleyways. Puxi 

is also home to Shanghai’s, 
and probably China’s, 
most prized biological 
sciences research institute, 
the Shanghai Institutes 
for Biological Sciences, in 
addition to strong research 
hospitals, such as the Ruijin 
and Zhongshan hospitals, and 
the medical schools of Fudan Uni- 
versity and Jiao Tong University. “We 
were looking for a scientific environment,’ 
says Jingwu Zang, who heads GSK’s new out- 
post, called R&D China. “Zhangjiang is for 
services. We want hard-core science.” 

R&D China’s motto is “Discovered in 
China, and it is emblazoned over an image 
of the Oriental Pearl Tower on the wall of the 
company’s temporary office in Zhangjiang. 
Zang hopes that the phrase will replace the 
ubiquitous ‘made in China that brands the 
mostly low-margin goods made there. The 
unit will be the global centre for GSK’s new 
neurodegenerative and neuroinflammatory 
drug-development programmes, focusing 
on multiple sclerosis, Parkinson’s disease and 
Alzheimer’s disease. The plan is to employ 
1,000 people over the next 10 years. The 
company declined to give the full cost of the 
enterprise, but such a unit would be three to 
four times larger than those of other drug 
companies investing in the country. 

GSK’s decision to build such a large and 
permanent facility, rather than rent space as 
most companies do, suggests great confidence 
in what can be accomplished in China. It is 
particularly striking considering that GSK 
announced plans last autumn to lay off 5,000 
people in other branches. Zang says that the 
company was drawn to China by several fac- 
tors, and that cost-saving is not one of them. 
One attraction is the growing talent pool of sci- 
entists who trained overseas and have already 
returned or are willing to return. Almost half 
of R&D Chinas scientists are people who have 
returned from working in the United States or 
Europe, says Zang. Researchers are being lured 
back by government-supported research insti- 
tutes, university positions and the opportu- 
nity to start their own government-supported 
businesses (see Started in Chind). He says the 
company also has in mind he benefits of being 
able to transition drugs to the growing Chinese 
market more easily. 

Zang rates the “overall interactive scientific 
environment” in Puxi as canparable to that in 
the Boston area of Massachusetts and hopes 
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to thrive in it. Over the past decade, GSK has 
collaborated with the Shanghai Institute of 
Materia Medica and it plans to do mare with the 
Shanghai Institutes for Biological Sciences and 
the Shanghai Institute of Organic Chemistry. 

China also offers opportunities to try an Asian 
angle on some intractable research problems. 
For example, Zang plans to investigate how 
multiple sclerosis differs between Asian and 
white populations, drawing on patients from 
Ruijin, Zhongshan and other hospitals. Until 
now, most researchers studying multiple sclero- 
sis, such as those in the Consortium of Multiple 
Sclerosis Centers, of which GSK 
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Most drug firms|expanding to China 


oncology. They subcontract different stages of 
the drug-development process to Zhangjiang’s 
myriad CROs, in which some 300 staff mem- 
bers are dedicated to Lilly’s work. The com- 
pany’s China unit, which officially opened its 
doors on 15 October, is calling itself a “fully 
integrated pharmaceutical network”. 

“In a smaller arganization, you know the bot- 
tom line much better and you know whether you 
made the right decision much more quickly,” 
says Tony Zhang, head of Eli Lilly’s drug-discov- 
ery unit. “You can consider what we are doing in 
China an experiment. We don't have to own eve- 

rything.” And is an experiment 


is a member, have mainly stud- “There are no success he believes will work. “There is 
ied white people. “In the United stories, nobody has a large pool of scientific talent, 
States we hit a wall” says Zang, . vibrant entrepreneurs, and good 
who was the research director of built an Amgen il , _ infrastructure,” he says. 

the Baylor College of Medicine Genentech in China. A similar experiment is being 
Multiple Sclerosis Comprehen- — Mingwei Wang planned by Pfizer. The firm 


sive Care Center in Houston, 

Texas, for almost 10 years. Asians seem to have 
a lower incidence of the disease and there is at 
least one clue as to why — susceptibility genes 
in the major histocompatibility complex some- 
times differ in patients grouped by race. “We will 
look at the genetics very closely,” he says. 


Business base 

Eli Lilly is also pouring millions of dollars 
into China — US$100 million over the next 
5 years — but it couldn't be spending it more 
differently. Its rented office in Zhangjiang Hi- 
Tech Park employs ten people, and none does 
scientific research. Instead, the experienced 
research scientists effectively figure out how 
to pay other companies to do drug discovery 
for them, focusing initially on diabetes and 
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already has a China Research 
and Development Center that focuses on drug 
development and dinical trials, but it is panning 
to launch a similar ‘virtual R&D network that 
will work through partnerships with academic 
institutions and CROs. Such efforts by Pfizer, 
Roche and other companies are like “extensions 
of global operations’, says Wang. By keeping the 
in-house operation small, “it’s easier to open and 
easier to close if things don't go well”: 

Being able to open, close or change direc- 
tion quickly is a moneysaver. Beatrijs Van 
Liedekerke, an associate director at Price- 
WaterhouseCoopers, Beijing, in charge of the 
business advisory group for the biotechnology, 
pharmaceuticals and medical-device industries, 
estimates that outsourcing in Chinese pharma- 
ceutical R&D saves some 30% in costs. “They 
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Started in China 


While pharmaceutical companies 
around the world are falling over 
themselves to invest in China, 
Chinese biotech companies are 
struggling to be taken seriously 
internationally. According to 
Mingwei Wang, director of 
China's National Center for Drug 
Screening, the 6,600 companies 
in existence earlier this decade 
has shrunk to fewer than 3,000 
after a law was implemented in 
2004 requiring that drugs be 
made in accordance with the 
“good manufacturing practices” 
that are standard elsewhere. The 
remaining firms generally make 
generic versions of patented 
drugs. Just 3% of the drugs that 
domestic companies get approved 
in China each year are ‘discovered 
in China’— and most of these are 
“new formulations of traditional 
Chinese medicine”, which aren't 
really new, says Wang. 


outsource because they don't have the expertise, 
they don’t want to invest, or they just have too 
many things going on,” she says. Gerald Chan, 
a co-founder of the Morningside, an investment 
company active in China, is blunter. “Drug 
companies had to do something to please Wall 
Street,” he says, “and that meant outsourcing” 

Roche, which in 2004 became one of the first 
multinationals to establish a research unit in 
Shanghai, is somewhere between the Eli Lilly 
and GSK models. Having spent some US$70 
million to $80 million over the past four years, 
it now employs around 90 people in house. But 
it also relies heavily on CROs and partnerships 
with universities that, taken together, give it a 
total of about 250 research staff, says Chen. 
“You can build a drug-design chain without 
leaving Zhangjiang,’ he says. 

The role of Roches Chinese branches is partly 
to adapt pre-existing drugs by verifying the 
appropriate dose for Chinese people — essential 
work if the company wants to gain regulatory 
approval for drugs it has developed abroad. But 
the company also wants to make this a two-way 
flow, by developing a programme to identify and 
evaluate new drug candidates in China as fa as 
phase II dinical trials, and pass them to Roche’s 
global portfolio. Roche's China branch already 
has two cancer drugs in the preclinical stage 
that are ready for human trials next year. Chen 
agrees that outsourcing speeds up progress. 
“You can do several steps in parallel, and flex- 
ible resources will reduce the waiting time when 
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Now, encouraged by 
government support, Chinese 
companies are slowly gearing 
up towards new drugs. But the 
handful of true discovery 
companies — usually 
biotech firms rather than 
pharmaceutical companies 
— still gain little traction with 
investors, potential partners 
in multinational companies 
or the US Food and Drug 
Administration (FDA). 

In 2002, when Qiang Yu 
set up Shanghai Ambrosia 
Pharmaceuticals with about 
20 million renminbi (US$2.5 
million) from the government, 
“there was no drug discovery” 
in China, he says. “It was a 
blank field.” Yu says he wants 
the first Chinese drug approved 
by the FDA to be his. The most 
promising drugs at his company 
are botanicals, screened from 


& 


you have numerous projects with multiple com- 
pounds,’ he says. “By picking different CROs to 
do the same services in parallel, whether it’s drug 
metabolism and pharmacokinetic studies, drug 
safety or oncology animal studies, pharmaceuti- 
cal companies can avoid a backlog at any given 
point in drug development.” 

Wang cautions that there are limitations 
on just how much can be outsourced. Screen- 
ing drug candidates for biological activity, for 
instance, requires experience that CROs are 
short on and this could be a concern when it 
comes to generating high-quality data. Very 
few of Zhangjiang’s company’s offer 
these types of screen. “CROs 
are passive. If you say 
4 weeks, they'll do it. But 
they won't spot early 


Eli Lilly is setting up a network of subcontractors 
in China. 
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traditional Chinese 

medicine libraries, 
which he claims work 
in combination to block tumour 
growth. “Multinationals don't want 
a botanical mixture. They want a 
single compound,” says Yu. 

Yu is now in the second phase 
of fundraising but says that he is 
struggling to find pharmaceutical 
partners or investors. “Everybody 
is looking for later-stage products,” 
he says. “They are demanding a 


lot of data.” 
More data might not be 
enough. Zensun (Shanghai) Sci 
& Tech, established in 2000 with 
200 million renminbi (US$30 
million) from government 
and private sources, has 
taken a heart-failure drug 
to phase Il trials at a fraction 
of the price it would have cost 
in the United States. But because 
the FDA sometimes doesn't 
accept phase II data obtained 
independently in China, the firm 
has not been able to convince 
potential investors of the drug's 
value. “When you apply to the 
FDA, they'll ask you to restart,” says 
Mingdong Zhou, Zensun’s chief 
executive. “It's a serious problem 
because investors evaluate us as if 
we were at preclinical stage, even 
though we are already in phase II or 
phase III clinical trials.” D.C. 


signs of deviation. It’s a recipe for disaster. To 
interpret the data, you must be experienced.” 

Van Liedekerke, who has been based in 
China for the past two and a half years, says 
that China has a ld going for it, but it won't be 
a “deus ex machina that solves all of pharma’s 
problems” It might solve some of them though. 
She gives an off-the-cuff estimate of 3-5 years 
before drugs discovered in China start making 
their way into health systems around the world. 
“It’s just a matter of time,” she says. 

Drug discovery and development is becom- 
ing more complicated both scientifically and 
managerially, she says, and requires people with 
expertise in many fields to come together. 

“The people that can communicate the 
best will win cut.” And whoever wins, 
less money will probably have been 
spent trying out new models in 
China than elsewhere. 

Chan estimates that a full- 
time chemist in the United States 
costs US$225,000 in salary and 

research costs per year compared 

with $70,000 in China. But the price 

is rising quickly owing to competition 

and rising prices. “Shanghai used to be 

the most desirable spot. Now we wouldn't 
dare touch it,” he says. 

“Prices in China are going up,’ Chien agrees. 
“Tt will not be a bargain for much longer? = ™ 
David Cyranoski is Nature's Asia Pacific 
correspondent. 
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Electronic voting machines were supposed to vanquish unreliable 
counts. They did not — but David Lindley finds that other technologies 


n the US mid-term election of 7 November 
2006, the balloting in Sarasota County, 
Florida, was decidedly high-tech. Voters 
recorded their choices on electronic touch- 
screen machines that had been installed follow- 
ing the debacle of Florida's 2000 presidential 
race between Republican George W. Bush and 
Democrat Al Gore. Then, recounts and legal 
actions left the United States uncertain of its 
next leader for more than a month before Bush 
was eventually declared winner of the state — 
and the country — by little more than 500 votes 
out of almost 6 million. The devices promised 
to prevent a repeat of that event, memorable for 
its images of officials solemnly peering at hang- 
ing, pregnant and dimpled chads on punch- 
card ballots, trying to decide which of them to 
count as tne votes. Surely with electronic vot- 
ing, any such ambiguity would be impossible? 
Evidently not. By the morning of 8 November, 
election officials were grappling with com- 
plaints about machine glitches, unrecorded 
votes and flipped’ votes allocated to the wrong 
candidate. Somehow 13% of voters apparently 
failed to register any choice at all in the closely 
fought congressional race between Republi- 
can Vern Buchanan and Democrat Christine 
Jennings. By the time the protests and the law- 
suits were over, and Buchanan was officially 
declared the winner by 369 votes, the conclu- 
sion was painfully clear: one set of problems 
had been traded for another. 
The effort to reform US voting technology 
has not been successful. Florida, for example, 


present their own problems. 


threw out its old mechanical and paper-based 
voting technology after 2000, and switched 
to electronic systems, aided in part by funds 
allocated by the 2002 Help America Vote Act 
(HAVA). Critics of the move were quick to say 
that electronic voting machines were vulnerable 
to hacking and other attacks that could allow 
people to change the election results. And then 
experiences in Sarasota and elsewhere seemed to 
suggest that electronic machines could produce 
untrustworthy counts in many other ways. 

As a result, Florida, New Mexico, Iowa and 
several other states have now rejected electronic 
voting and gone back to paper 
ballots that are marked by hand, 
typically by filling in a Hank oval, 
and scanned by machine. But the 
cure may be worse than the dis- 
ease — especially given the long 
history of paper ballots being lost, 
stolen, faked and stuffed into bal- 
lot boxes on the sly. “It's the most 
insecure medium there is,” says 
Paul Herrnson, a political scientist 
at the University of Maryland in 
College Park. 

Meanwhile, some worry that 
arguments over the actual and 
alleged flaws of electronic vot- 
ing systems have overshadowed 
more immediate concerns about usability and 
reliability. During the Florida 2000 election, 
for example, Palm Beach County’s infamous 
‘butterfly ballot’ — a split-page design with 
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“Paper is the most 
insecure medium 
there is.” 

— Paul Herrnson 


punch-card slots running down the middle — 
may have led a few thousand Gore supporters 
to accidentally vote for Reform candidate Pat 
Buchanan. Misleading ballot design, it seems, 
may also have caused problems in Sarasota. 

Adding to the confusion is the fact that in 
the United States, conducting elections is the 
responsibility of state and local governments. 
The result is a patchwork in which nothing is 
standard or uniform, something that the HAVA 
legislation did little to change. 

Voters heading to the polls on 4 Novem- 
ber could face any number of glitches and 
anomalies, and the possibility 
that, once again, the outcome of 
a close presidential race could be 
shrouded in uncertainty. 


Universal problem 

Controversies over electronic 
voting are not unique to the 
United States. The Netherlands 
embraced electronic voting in 
the 1970s, in part to deal with 
the complexity of the nationwide 
system of proportional repre- 
sentation by which members 
of parliament are elected. The 
Dutch system came into being 
before hacker culture was wide- 
spread, says Doug Jones, a computer scientist 
at the University of Iowa in lowa City who 
has observed elections in the Netherlands. 
But in light of recent concerns, the country 
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has decided to return to paper balloting. 
Brazil, by contrast, introduced electronic vot- 
ing beginning in 1996, and now has a fully 
electronic system. In the initial stages, as many 
as 7% of voters trying the new system were 
unable to record their choices electronically, 
but that figure fell to less than 0.2% by 2000, 
and the country remains committed to its 
voting technology. 

Lingering memories of the 2000 election, 
along with the chance of a close contest on 
4 November, make questions about voting 
technology especially 
urgent in the United 
States. The debate 
has been particularly 
contentious when 
it comes to direct 
recording electronic 
(DRE) devices, in 
which a voter's choice 
is translated immedi- 
ately into electronic 
data. Sarasota’s iVotronic voting machines 
—made by Election Systems & Software based 
in Omaha, Nebraska — generated no record 
of each vote other than the electronic data. (To 
satisfy a HAVA audit requirement, each one 
did print out a paper copy of its tally once the 
election was finished.) In Sarasota, therefore, 
there was “basically no evidence that could 
be examined after the fact that would explain 
what went wrong’, says Jones. 

Concerns that DRE machines are vulnerable 
to undetectable tampering took off in 2003, 
when a group of computer scientists induding 
Avi Rubin of Johns Hopkins University in Balti- 
more, Maryland, and Dan Wallach of Rice Uni- 
versity in Houston, Texas, published a report 
(T. Kohno et al. IEEE Symposium on Security 
and Privacy 9-12 May 2004) pointing out 
serious security flaws in the software run- 
ning the AccuVote-TS, a DRE system 


“Software fixes 
can really mess 
things up if 


you're making 

changes at the 

eleventh hour." 
— Avi Rubin 


made by Diebold based in North Canton, 
Ohio. (A copy of the software had been posted 
on the Internet some months earlier.) As a 
safeguard against malfunction or tampering, 
the authors recommended that DRE systems 
should generate voter-verified paper audit 
trails (VVPAT) — literally, a paper display on 
which a voter could check that his or her vote 
had been cast correctly, and which could be 
kept as a record of the vote should any irregu- 
larities come to light. 

Such a system was put in place in the May 
2006 primary elec- 
tions in Ohio, with 
printers being added 
to existing DRE sys- 
tems. It was not a great 
success. A study com- 
missioned by Ohio’s 
Cuyahoga C ounty 
found that almost 10% 
of the paper records 
were useless or absent, 
because, for example, the printers jammed, ran 
out of paper or overprinted. Ted Selker, who until 
June was at the Massachusetts Institute of Tech- 
nology (MIT) in Cambridge as co-director of the 
California Institute of Technology/MIT Voting 
Technology Project, says that he saw similar 
problems when he observed elections in Nevada 
in 2004, ae of the first occasions when a VVPAT 
system was added to electronic voting. 

So a DRE device will record many legitimate 
ballots for which no paper record, or an illeg- 
ible one, exists. That creates a further problem, 
says Michael Shamos, a computer scientist at 
the Carnegie Mellon University in Pittsburgh, 
Pennsylvania, who is also a lawyer. Election law 
generally states that the paper records, not the 

electronic data, constitute the legal ballot. 


Reliable touch- 
screen voting 
systems are still 
years away. 
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Although Rubin originally thought that 
adding VVPAT technology would make DRE 
systems workable, he now says that experience 
with paper-trail systems and poor implementa- 
tion by the vendors has killed even that hope. 
And in any case, he says, VVPAT systems do 
nothing to solve the fundamental problem with 
electronic voting, which is that it depends on 
the reliability of software. Rubin says he is less 
concerned about external hacking or tamper- 
ing than about software going wrong, or the 
risk of malicious actions from within voting- 
machine manufacturers. He is quick to add that 
he has never claimed that any manufacturer is 
corrupt, but thinks it’s foolish to use systems 
that depend on them not being corrupt. 


Lost in the system 

Mundane programming errors may present the 
main danger. In the March 2008 Ohio primary 
election, for example, poll workers found that 
votes were sporadically lost during the trans- 
fer of data from individual voting-machine 
memory cards to the central system; after dis- 
covering the problem, election officials were 
able to re-read the memory cards and establish 
the correct totals. At the time, Premier Election 
Solutions — the name under which the voting 
machine division of Diebold now operates — 
claimed that the data loss resulted from a soft- 
ware clash with an antivirus program on the 
central system. On 19 August, however, Pre- 
mier acknowledged in a letter to Ohio secretary 
of state Jennifer Brunner that it had found an 
internal software bug that could cause votes 
to be dropped when data from two memory 
cards were being read at the same time. Elec- 
tion officials have now circulated guidelines to 
help poll workers to circumvent the problem 
without altering the voting equipment. 

But even if Premier devised a software patch 
that repaired the glitch, says Shamos, the com- 
pany couldn't install it without losing cer- 
tification of its system for use in federal 

elections. The HAVA legislation vested 
authority for such certification in a new 
Election Assistance Commission (EAC), 
which issued revised guidelines for the 

conduct of elections in December 2005 and 
took over the federal certification procedure 
in January 2007. That certification is voluntary 
— not every state complies — but when they 
do, it is not a quick process. Of the eight voting 
systems currently under evaluation, none has 
yet received EAC certification. Systems now in 
use were accredited under the old procedure, 
which was overseen by the National Associa- 
tion of State Election Directors. 

Shamos argues for an accelerated process to 
certify small changes, “especially when it pre- 
vents a much more serious problem”. But Rubin 
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Electronic voting has caused 
confusion for vendors, election 
officials and voters, but is it still 
more promising than paper? 
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thinks that is unzealistic. He says that even large 
software companies, despite all the testing and 
checking they do, keep releasing new versions 
of software as bugs and problems are fixed, and 
then the fixes give rise to new problems that 
require additional fixes. A quick turnaround is 
“an opportunity to really mess things up if you're 
making changes at the eleventh hour’, he says. 

The certification process also impinges on 
the question of software disclosure. Both critics 
and advocates of DRE systems generally argue 
that companies should make public the soft- 
ware they use, which would give them greater 
incentive to resolve security issues. Vendors can 
still protect themselves by patenting their ideas 
and holding copyright to their code, Wallach 
says, and disclosure should not generate secu- 
rity issues for well designed software. “Only if 
their systems are built like garbage does disclo- 
sure become a security problem.” 

But then, as Rubin points out, voting-machine 
design “needs to be set in concrete in order to 
be certified”. He imagines the case of a software 
security flaw or bug being discovered just before 
an election. “If you try to fix it, you will now be 
using an uncertified voting system and that 
would be an avenue for someone to tamper with 
the votes; he says, but if you dontt fix it “you're 
going to use a system that you know is bad’. 


Design errors 

Another concern about voting machines is 
that they all too often neglect basic human 
factors. For example, use of a printed paper 
record tacitly assumes that a voter confronted 
with an incorrect paper ballot will reject it and 
ask to start again. But Selker says that in prac- 
tice not many voters pay attention to the paper 
printout, or understand its purpose. 

That conclusion is supported by Sarah 
Everett's PhD thesis at Rice University. Ever- 
ett conducted mock elections to compare the 
performance of various voting technologies, 


including punch cards, paper ballots and a 
touch-screen interface designed by Wallach 
and his colleagues. In some experiments, 
deliberate errors — omitting an election race, 
or flipping a voter’s selection — were intro- 
duced on the electronic system’s final review 
screen, which subjects were asked to check 
before they cast their ballot. Almost all par- 
ticipants said that the review screen gave them 
more confidence that their votes had been 
correctly registered. Only about one-third 
of them, however, actually noticed that their 
votes had been changed. 

Of course, Jones 
says, if even just a 
few people verify | ivr nici 
their vote, such Che stim 
review systems 
can still provide 
warning of sys- 
tematic problems, 
as long as election 
officials act. In 
the Sarasota case, 
there was “ample |“ 
evidence t hat 
something was 
wrong before polls 
officially opened”. 
Difficulties with 
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The ballot design may have led some voters in Sarasota 
County to miss the congressional race at the top. 


produced no conclusive evidence of machineS 
malfunctioning, although Wallach says thaté 
the tests did not address the full range of trou-2 
bles that voters reported. “We have all kinds? 
of evidence of touch-screen malfunction and> 
miscalibration,” he says. Touch screens must& 
be calibrated so that they correctly connect thez 
place where a voter touches the screen with thed 
corresponding spot on the displayed ballot. © 

The Sarasota Herald-Tribune, however, 
suggested poor ballot design as another cause 
of the undervote. The Buchanan-Jennings 
race, in which there were no other candidates, 
appeared at the top of, 
a screen, separated 
from the governor/3 
lieutenant governors 
race by a blue ban- 
ner. The suggestion 
was that a voter's eye 
would be drawn to 
the banner and the 
large race following 
it, and could easily 
overlook the smaller 
race at the top. 

Selker tested this by 
running mock elec- 
tions using the Sara- 
sota ballot screens. 


SOWERNOR 


the machines were 
flagged by early voters, who under Florida law 
can vote ahead of election day, but their com- 
plaints brought no change in voting procedure. 
The Sarasota ‘undervote’ came almost 
entirely from the 120,000 people who voted 
electronically, of whom almost 18,000 failed 
to register a choice in the Buchanan-Jennings 
congressional race. Among the roughly 20,000 
absentee voters who sent in optical scan bal- 
lots, the undervote was a more typical 2.5%. 
Post-election testing by Florida officials and 
later by the Government Accountability Office 
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He found that more 
than 16% of the test subjects missed the con- 
gressional race. When he put the race at the 
bottom of the previous page instead, almost 
19% missed it. 

Selker also has an explanation for the claims 
of vote flipping. He cites a study by Sarah 
Sled of the Caltech/MIT Voting Technology 
Project of the 2003 election that made Amold 
Schwarzenegger governor of California. Of 135 
candidates, only Schwarzenegger and two oth- 
ers got a substantial number of votes. The six 
candidates immediately above or below one of 
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the three major candidates on the ballot did 
slightly but significantly better than the other 
minor candidates. Sled’s conclusion was that 
about 0.4% of voters inadvertently voted for a 
candidate adjacent to their true choice. 

Claims of vote flipping arise, Selker main- 
tains, when voters mistakenly choose the 
wrong candidate and then, when they see a 
confirmation screen, assume that the machine 
has erred. But that’s a point in favour of well- 
designed DRE devices, he argues: they give vot- 
ers an immediate opportunity to review and 
correct their choices. 

DRE systems are also easy to use. Herrnson 
and his colleagues conducted experiments in 
which volunteers worked through a moderately 
complex mock election on a variety of electronic 
systems, and with a hand-marked paper ballot. 
Volunteers rated the systems on a number of 
criteria, including ease of use, ability to correct a 
mistake and confidence that their choices were 
accurately recorded. Two of the three touch- 
screen systems got most of the high marks; the 
third scored less well largely because it jumped 
to the next screen without waiting for voters to 
signal that they were ready to proceed. 


Judged too early? 

Shamos points out other advantages of DRE 
systems. They can more easily guide voters 
through complex ballots, can provide the bal- 
lot in a variety of languages, can adjust font 
sizes for voters with visual difficulties and are 
generally better for disabled voters. He says 
that the alarm raised over software security 
“was a perfectly good message, but it turned 
into a campaign of incredible vituperation 
against the concept of a compu- 
ter being used in voting”. 

As a safeguard against the 
manipulation of electronic votes, 
Shamos says that the data repre- 
senting the touch-screen image 
could flow directly into a non- 
rewritable recording medium. That record 
could be used to reconstruct the voter's opera- 
tions on the touch screen. 

Jones, however, is sceptical. Independent data 
capture has promise, he says, but ‘can you really 
show to me that it is independent?” What these 
systems aim to achieve, he says, is the electronic 
equivalent of a “camera over the shoulder”, 
directly recording the voter's actions, but such 
techniques require data transfer and storage, 
and often end up being hackable themselves. 

Apart from their technical challenges, 
proposals to improve security and provide 
trustworthy data back-up increase the cost of 
voting systems and make life more difficult for 
volunteer poll workers, whose training is often 
perfunctory. In addition, says Jones, election 
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“Paper-ballot 
systems also have 


shortcomings.” 
— Doug Jones 


officials themselves are intimidated 
by novel equipment, and leave 
instruction and trouble-shooting 
to representatives of the vendor. 
Concerns over the expense of 
voting equipment, along with the 
uncertain legislative atmosphere, 
combine to make “a treacherous 
marketplace’, says Peter Lich- 
tenheld, a spokesman for Hart 
InterCivic in Austin, Texas, which 
makes both DRE and optical-scan 
voting systems. 

There is still no sign of resolution. 
Critics of DRE systems concede 
that no voting system is per- 
fect — but insist that elec- 
tronic voting is uniquely 
susceptible to failures 
with catastrophic, 
system-wide con- 
sequences. Wallach 
argues, for example, 
that defrauding a 
paper ballot is labour- 
intensive, whereas for 
a hacker who has the 
ingenuity, and the means, to 
mount an attack on electronic 
systems, “the cost to throw an elec- 
tion is dirt cheap”. But Herrnson counters that 
that distinction is not as stark as it might seem. 
Anyone wishing to throw an election the old- 
fashioned way would pick on close races and 
would know where tampering with a few ballot 
boxes would swing the result. 

“Some of the critics of DRE voting machines 
uncritically recommend paper- 
ballot systems without under- 
standing that they too have 
shortcomings,” says Jones. 
Even so, he thinks that optically 
scanned paper ballots represent 
the best compromise. Paper 
ballots are easy to understand, he says, and 
although they are far from immune to fraud, 
“the frauds are understandable and the defences 
against the frauds are understandable”. 

For now, Rubin endorses the same system, 
but “not because of some love for paper sys- 
tems’, he says. “It’s just that I think DRE sys- 
tems have too many inherent problems.” 

Florida has now embraced optically scanned 
paper ballots. But on 26 August, the Sara- 
sota Herald- Tribune reported that a Premier 
scanner being used to read absentee ballots 
could not upload its data to the central tally- 
ing system, and vote totals had to be entered 
manually. And in the District of Columbia, 
which uses optical scanning technology from 
Sequoia Voting Systems, based in San Leandro, 
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Electronic votes are stored on 
memory cards before counting. 


California, early vote counts in a 
9 September primary election for the 
city council were inflated by thousands of 
phantom votes that disappeared in subsequent 
election reports. The cause of the problem has 
not been definitively identified. 

Moreover, Herrnson warns, the return to 
paper ballots inevitably raises the spectre of 
a repeat of Florida 2000, with a close result 
spawning unresolvable arguments over what 
does and does not constitute a legitimate mark. 
In place of the relentless focus on the flaws of 
electronic voting, he says, what is needed is a 
dispassionate comparison of voting systems 
taking security, reliability and usability into 
account. “The debate would benefit from a 
panel of election administrators and com- 
puter people who aren't bound to one posi- 
tion,” he says. What's also needed, says Jones, 
is patience. In recent years, the discovery of 
electoral flaws has led to “irresponsible and 
unrealistic” expectations that new and better 
equipment can be put into place in a matter 
of months. Vendors need two years to develop 
voting systems to meet new standards, he says, 
and election officials need another two years to 
get the systems working reliably. 

That kind of time might be available — as 
long as this November’s election doesn’t end 
in contention and acrimony. a 
David Lindley is a freelance science writer 
based in Alexandria, Virginia. 

See Editorial, page 1149. 
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Paradox of flying to 
meetings to protect 
the environment 


SIR — Scientists are becoming 
increasingly concerned about the 
environmental impact of their 
work. For example, in the latest of 
many recent reflections on the 
subject, bioinformatician Hervé 
Philippe urges us to uncouple 
scientific progress from economic 
growth (Trends Genet. 24, 
265-267; 2008). He adds his 
voice to those advising us to 
reduce our carbon footprint 

by attending fewer scientific 
conferences (see also the Editorial 
‘Meeting expectations’ Nature 
455, 836; 2008). 

Regular long-distance flying can 
easily triple an academic’s carbon 
footprint. During the past year, 
| have ‘spent’ about nine tonnes 
of carbon, two-thirds of this on 
plane trips. Yet |am a good 
consumer otherwise (see www. 
carbonfootprint.com), and | 
don't even own a car. Such figures 
are particularly hard for field 
ecologists to stomach, as we hope 
our long-term work will highlight 
the environmental consequences 
of climate change and may 
ultimately influence the public 
and policy-makers. 

Take, for instance, the dynamic 
field of conservation biology. 
Most of its best researchers are 
based at universities in the 
Northern Hemisphere, but most 
of their field sites are located in 
developing countries in the 
south. These hotshots and their 
students use up tonnes of fuel 
each year in commuting trips. 

In fact, those who are particularly 
renowned and most involved 

in environmental politics 
become ‘constant fliers’ who 

are always jetting off to field 
sites and meetings. 

| estimate that such behaviour 
can potentially increase an 
individual's carbon footprint to 
ten times their national average. 
Of course, plenty of businessmen 
have similar or larger carbon 
footprints, but few of them would 
claim that their journeys are good 


for the environment. One is 
left wondering whether the 
carbon footprints of ecologists 
outweigh the environmental 
benefits of their findings and 
of their lobbying. 

One way around this problem 
is to promote conservation 
biology on site, with improved 
local academic training (see, 
for example, http://tinyurl. 
com/65kh9k), local research 
leaders and fewer but longer 
stays by foreign visitors. With 
regard to environmental politics, 
one faces the classic dilemma 
between personal restraint (| stay 
at home and work in the garden) 
and energy-demanding public 
involvement (I fly daily to help ban 
overfishing). 

The outcome is a personal 
decision that may be dictated 
more by ambition than by 
environmental awareness. 
Nevertheless, as a German 
environmental campaigner told 
me 15 years ago, “Industry would 
be all too pleased if we did not 
attend distant meetings because 
we refuse to board aeroplanes.” 
David Grémillet Centre d'Ecologie 
Fonctionnelle et Evolutive — CNRS, 
1919 route de Mende, F-34293 
Montpellier, France, and FitzPatrick 
Institute of African Ornithology, 
University of Cape Town, Rondebosch 
7701, South Africa 
e-mail: david.gremillet@cefe.cnrs.fr 


Readers are welcome to comment 
at http://tinyurl.com/Shqhw. 


Science students 
need to develop the 
skills of artists 


SIR — In your ‘Big data’ issue, 
Felice Frankel and Rosalind 

Reid call for an “investment in 
visual communication training 
for young scientists” to prepare 
them for modern methods of 
data representation (‘Distilling 
meaning from data’ Nature 455, 
30; 2008). But the problem goes 
deeper than that. 

Graphic artists who collaborate 
with scientists have often 
been shaped by the other of 
C. P. Snow's ‘two cultures’. 
Although well-intentioned, 
many artists’ understanding 
of basic science is inadequate 
for meaningful participation 
in high-level technical work. 
Cognitive art is like commercial 
art and technical writing: it has 
never garnered respect from the 
artistic establishment, and its 
practitioners are left to fend for 
themselves. 

From the start of schooling, 
distinctions are made between 
students with a talent for science 
and those with leanings towards 
the arts. In our technology-focused 
society, science receives more 
attention and an emphasis that 
does not include visual-thinking 
skills. Calculus, for example, 
is learned through symbolic 
operations, but portraying those 
procedures by using curves 
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and surfaces and tangents and 
intercepts is typically considered 
to be an unnecessary frill. 

Thus the two cultures diverge, 
and if we try to reassemble them 
later to let one benefit the other, 
we have serious difficulties: 
the world views don’t match. 
Subjective ideas can be stifled 
by objective thought but, by the 
same token, physical reality can 
be mismanaged by well-meaning 
attempts at creativity. 

We must indeed invest in 
visualization skills for science- 
bound students, but there should 
be a parallel path for science- 
illustrators-to-be to learn the 
basics of physics, chemistry and 
mathematics. Collaborators who 
understand each other's language 
have a much better chance of 
finding the common ground 
they need for the cooperation 
they seek. 

Kenneth R. Jolls Chemical and 
Biological Engineering, lowa State 
University, 2114 Sweeney Hall, Ames, 
lowa 50011-2230, USA 

e-mail: jolls@iastate.edu 


Military government 
support of science in 
Pakistan is illusory 


SIR — Your Editorial ‘After 
Musharraf’ (Nature 454, 1030; 
2008) seems biased in its 
comments on the role of elected 
governments in Pakistan. You note 
that the military has ruled the 
country for longer than civilians 
have, yet you blame civilian 
governments for the lack of 
development of science. 

The military governments 
enjoyed unaccountable and 
unchallenged power. For example, 
they were not required to reveal 
details of their defence budgets, 
and military pensions were passed 
over to the civilian budget when 
international agencies applied 
pressure for the military budget 
to be cut down (using the 
argument that retired military 
personnel are civilians). 

Under General Pervez 
Musharraf's rule, generals were 
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appointed as vice-chancellors of 
many of Pakistan's universities, 
with adverse effects on the 
morale of academicians and 
researchers. None of the military 
governments made a serious 
effort to promote science. Against 
this background, it is surprising 
to read: “Worryingly, Pakistan's 
governance of its science seems 
all set to revert to the situation 
that prevailed under previous 
elected governments.” 

It would be interesting to 
compare the rise in the military 
budgets under military and 
elected governments with the 
budgets for education and health. 
Neither military nor elected 
governments have ever dared 
to reduce the military budget in 
real terms. 

Muhammad Naim Siddiqi, Abdul 
Wahab Yusufzai Department of 
Psychiatry, Aga Khan University, 

PO Box 3500, Karachi 74800, Pakistan 
e-mail: naim.siddiqi@aku.edu 


Doping: world agency 


sets standards to 
promote fair play 


SIR — We at the World Anti- 
Doping Agency (WADA) 
find parts of Donald Berry's 
Commentary ‘The science of 
doping’ (Nature 454, 692-693; 
2008) potentially damaging to 
the fight against doping in sports. 
WADA‘ accredited laboratories, 
including the French national 
laboratory LNDD (which handled 
the Floyd Landis case), must meet 
standards set by the International 
Standard for Laboratories (ISL) 
in validation methods, staff 
competency and chain of custody, 
for example. Compliance is 
assessed independently by bodies 
of the International Laboratory 
Accreditation Cooperation. 
Berry casts doubt on the 
laboratory component of 
the anti-doping procedure 
as a whole, but he fails to 
mention that the majority of 
the substances reported by 
the anti-doping laboratories 
are exogenous substances not 
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naturally present in human 
beings. The development 

of testing procedures for 
endogenous substances includes 
samples from normal reference 
populations and from subjects 
administered with the substance 
under investigation, so that 
test-sample status and positivity 
criteria can be established. To 
determine cut-offs for the ratio of 
testosterone to epitestosterone 
(T/E), tens of thousands of 
athlete samples were analysed 
to establish reference values. To 
detect exogenous administration 
of endogenous substances 
(such as pharmaceutical 
testosterone) by isotope- 

ratio mass spectrometry 
(IRMS), validation is based on 

a combination of hundreds of 
known positive and negative 
samples analysed by many 
WADA anti-doping laboratories 
operating under the scrutiny of 
the ISL and of the International 
Organization for Standardization 
(such as ISO 17025). 

Each sample test includes 
positive and negative quality- 
control samples to assess the 
possibility of a false result. If these 
samples fail, the test must be 
repeated. An adverse analytical 
finding is not reported unless the 
quality criteria are met. 

Laboratories participate in at 
least four rounds of blind and one 
double-blind proficiency test per 
year; the results of each round 
determine the accreditation 
status of the laboratory. False 
positives mean immediate 
revocation of accreditation. 

Berry's implication that the 
results of T/E ratio and IRMS 
analyses are interdependent is 
not altogether correct: according 
to the WADA list of prohibited 
substances, IRMS analysis stands 
alone as the basis of an exogenous 
testosterone finding. This is 
supported by legal precedents. 

Contrary to Berry's suggestion 
that anti-doping tests may not 
be sufficiently specific, mass- 
spectrometry identification of 
exogenous substances relies 
onat least three diagnostic 
ions to avoid any interference 


or misidentification. For 
immunoassays, antibodies in the 
initial testing and confirmation 
procedures must have different 
antigen-epitope specificity. For 
analytes that are too small to 

have two independent antigenic 
epitopes, two different purification 
methods or two different 
analytical methods are used. 

WADA encourages its 
accredited laboratories to publish 
in peer-reviewed journals. 
Although complete information 
cannot be released without 
compromising the efficacy 
of an anti-doping test, this is 
made available to legal panels 
on request. 

WADA’s mandate is not to 
foster “a sporting culture of 
suspicion, secrecy and fear”, 
as suggested by your Editorial 
(Nature 454, 667; 2008), but to 
promote fair play by protecting 
clean athletes. 

Arne Ljungqvist, Luis Horta, Gary 
Wadler World Anti-Doping Agency, 
Stock Exchange Tower, 800 Place 
Victoria, Suite 1700, PO Box 120, 
Montreal, Quebec H4Z 1B7, Canada 
e-mail: violet.maziar@wada-ama.org 


Doping: using flexible 
criteria could reduce 
false positives 


SIR — In his Commentary ‘The 
science of doping’ (Nature 454, 
692-693; 2008), Donald Berry 
uses data from the Floyd Landis 
case as grounds for his claim 
that drug-testing practices are 
often based on flawed statistics. 
His concerns stand in contrast 
to the US Anti-Doping Agency's 
view of the outcome of the case 
(quoting from R. Mukhopadhyay 
and J. Griffiths Anal. Chem. 79, 
8823-8825; 2007): “It's really 
easy to play Monday-morning 
quarterback and see an i that's 
not dotted or at that's not 
crossed, but that inno way 
undermined the validity or the 
reliability of the work that was 
done by the French lab.” 

This quote seems frivolous 
in the context of a class of 
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drug-testing practices. These 
are based on separation of 
asample’s constituents by 
chromatography and detection 
of target compounds by mass 
spectrometry, one of the work- 
horses in anti-doping research. 
Identification is reported as 
positive when the test and 
reference sample signals agree 
within a particular tolerance 
window. 

The size of this window is not 
constructed with an acceptable 
risk of false positives in mind. 
Rather, fixed decision criteria hold, 
regardless of the quality of the 
laboratory or the signal properties 
of the target compound. However, 
alaboratory that produces 
relatively precise results should 
deploy stricter criteria. Likewise, 
target compounds should be 
differentiated so that information 
in their signals can be respected. 

Always deploying the same 
rigid criteria leads to a probability 
of false positives that depends 
on the particular laboratory and 
target compound in an undefined 
way. This situation is frustrating 
because the statistical solution 
— flexible criteria that account 
for various complications — was 
already published and thoroughly 
tested five years before these rigid 
criteria were introduced. 

So, is it ignorance of the 
literature or failure to understand 
the analytical problem at hand 
that underlies the ongoing usage 
of these arbitrary decision rules? 
Laboratories, as well as their 
clients and (re-)accrediting 
organizations, should start 
reflecting on their accountability 
with regard to this avoidable 
malpractice. 

N. M. Faber Chemometry Consultancy, 
Goudenregenstraat 6, 6573 XN 
Beek-Ubbergen, The Netherlands 
e-mail: nmf@chemometry.com 


Contributions may be submitted 
to correspondence@nature. 
com. Published contributions 
are edited. Issues of interest to 
authors are regularly featured at 
Nautilus (http://blogs.nature. 
com/nautilus), where we 
welcome comments and debate. 
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Modest heroines of time and space 


The astronomical activities of two women helped to open up the heavens and ensure that a city ticked 
to the same time. Patricia Fara asks if their laborious work was valued. 


Ruth Belville: The Greenwich Time Lady 

by David Rooney 

National Maritime Museum: 2008. 192 pp. 
E299) 


The Georgian Star: How William and 

Caroline Herschel Revolutionized our 
Understanding of the Cosmos 

by Michael D. Lemonick 

W. W. Norton: 2008. 178 pp. $23.95 


When writers want to make science history 
sound exciting, they often focus on inven- 
tions, using military vocabulary to enthuse 
about major breakthroughs, victories over 
illness or unprecedented advances along 
the upward path of progress. Dava Sobel’s 
Longitude (Walker, 1995) captivated millions 
of readers who enjoyed believing that a single 
inspired innovation — John Harrison's clock 
— had solved the greatest scientific problem 
of the age. Other historians are more circum- 
spect. In the opinion of David Rooney, author 
of Ruth Belville: The Greenwich Time Lady and 
curator of timekeeping at the Royal Observa- 
tory in Greenwich, London, “New technolo- 
gies don't simply replace old ones... they just 
add another layer of complexity to our lives.” 
Decades after electrical time signals were 
introduced, he explains, many public clocks 
still operated mechanically, telling different 
times and so generating confusion rather than 
imparting information. How could passers-by 
know which of them, if any, was right? 

An energetic and enterprising researcher, 
Rooney writes with verve, successfully infus- 
ing dramatic suspense into the everyday 
mundanity of horological history. He opens 
his saga in 1811 with a youth calling himself 
John Henry: his mother had fled to London 
from revolutionary France, but he concealed 
his foreign family name of Belville to secure a 
government job as an astronomical assistant. 
The book’s modest heroine is John’s daughter 
Ruth, who died in 1943 from gas poisoning 
after conscientiously turning down the lamp 
to save fuel. In between these two dates — 
an extraordinarily long stretch for only two 
generations — Rooney presents some of the 
fundamental stages during the agreement of 
universal time, spiced up with gory accounts 
of Jack the Ripper and a nineteenth-century 
terrorist whose efforts “to stop time” by blow- 
ing up a clock apparently initiated a chain of 
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The work of Ruth Belville (above right) let London's clocks display the correct time. 


events that influenced the twentieth-century 
Unabomber, Theodore Kaczynski. 

The delight of Rooney’s book lies not in such 
epoch-making events, but in his meticulous 
uncovering of the ordinary lives and activities 
that survived unchanged right through episodes 
of technological innovation. Before the railway 
system was introduced, synchronizing clocks 
in different parts of the country was unimpor- 
tant. But after daily life had become structured 
by rail timetables, it became essential to make 
sure that people all over Britain, and eventu- 
ally the entire world, were running to the same 
time. It was only in 1884 that international 
agreement was reached on Greenwich as the 
prime meridian, the zero line of world time. 
At the Royal Observatory, astronomers used 
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their high-precision instruments to determine 
the correct time. For around a century, the task 
of the Belville family was to ensure that this 
pronouncement was regularly and accurately 
communicated to other horological centres in 
London, such as watchmakers, factories and 
docks. 

Ruth Belville spent much of her life travel- 
ling around the city, carrying in her handbag an 
eighteenth-century chronometer that she took 
to Greenwich every Monday morning so that 
it could be checked for accuracy against the 
master clock and so act as a portable standard. 
Well into her seventies, she trudged along the 
streets, a mobile purveyor of absolute time who 
was cheaper and more reliable than contempo- 
rary radios. She supplemented this service by 
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The forthcoming US 
presidential election has 
spawned books about the 
science of voting. Political 
decisions can be made 
more democratic using 
social choice and game 
theory, allowing voters to 
express their preferences 
and enabling public 
goods to be divided fairly, 
explains Steven J. Brams in Mathematics 
and Democracy: Designing Better Voting 
and Fair-Division Procedures (Princeton 
Univ. Press, 2008). Brams has also 
updated his classic 1978 book, The 
Presidential Election Game (A. K. Peters, 
2008), which dissects presidential 
campaign strategies. 


Engaging emotions is 
key to winning votes 
and political arguments, 
4 argues George Lakoff in 
4 The Political Mind: Why 
You Can't Understand 
2Ist-Century American 
Politics with an 
18th-Century Brain 
(Viking, 2008). He 
suggests that fact-laden 
Democrat campaigns could benefit 
from using emotionally loaded phrases 
such as ‘tax relief’ or ‘war on terror’. 


GEORGE LAKOFF 


ovr AN ELEPEAT! 


POLITICAL MIND 


Why Yer 


In Gaming the Vote: Why Elections 
Aren't Fair (and What We Can Do About 
It) (Hill & Wang, 2008), William 
Poundstone seeks out the fairest voting 
systems and explains why the most 
popular candidate may not always win. 


Facing decisions on topics from 
nuclear terrorism to climate change, 
the new president might consult 
Richard A. Muller's Physics For Future 
Presidents: The Science Behind the 
Headlines (W. W. Norton, 2008). Or 
he might turn to Joseph S. Nye’s 

The Powers to Lead (Oxford Univ. Press, 
2008) for tips on how to mimic the 
best leadership styles. 


And for perking up that speech, he 
might consider reading Tippecanoe 
and Tyler Too: Famous Slogans and 
Catchphrases in American History 
(Chicago Univ. Press, 2008) by JanR. 
Van Meter, for the stories behind 50 
famous phrases, such as “nice guys 
finish last". 


ELECTION READING 
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telephoning individual customers, but in 1936 
the speaking clock was introduced. Known as 
TIM, it gave ordinary people immediate access 
to standardized time. In its first year, TIM took 
20 million calls in london alone, from custom- 
ers who dialled up to hear the recorded voice 
of Ethel Cain, an exchange operator picked 
out at an audition after her mellifluous read- 
ing of a poem by John Milton. Although there 
have been several other human announcers 
since then, this telephonic clock still speaks 
on, coexisting with more recent timekeepers 
that continuously set themselves automatically. 
As Rooney concludes, “New isn't necessarily 
better, it’s just different ... stuff endures.” 

Rooney has the rare gift of combining the 
obsessiveness of an academic sleuth with 
the fluency of a detective novelist: not many 
people would attempt to write entertainingly 
about how the dot of the Morse code became 
the pip of a time signal With its small format 
and subdued cover, Rooney’s book super- 
ficially resembles Michael Lemonick’s book 
The Georgian Star, which also uses an engaging 
and informative style to present a combina- 
tion of history and science. Like Rooney’s, 
Lemonick’s subjects are two close relatives — 
in this case, the siblings William and Caroline 
Herschel — who also relied on an accurate 
scientific instrument, a telescope. There are 
three Georgian ‘stars’ in this relationship: two 
astronomers, here cast in hewic mode, and the 
planet Uranus, originally named ‘George’s Star’ 
by William Herschel to gratify King George III 
of Great Britain and Ireland. 

The Herschels worked together on the 
massive telescopes that helped William to 
become famous for his discovery of a new 
planet and for his work on nebulae. Discon- 
certingly for modern readers with egalitarian 
aspirations, Caroline seems to have colluded 
in her own marginalization, protesting that 
“Tam nothing, I have done nothing... a well- 
trained puppy-dog would have done as much”. 


Caroline’s reputation improved towards the 
end of the twentieth century, when feminists 
commemorated her as the first woman to dis- 
cover a new comet, even though this is of little 
scientific significance when compared with the 
collaboration with her brother and the star cat- 
alogue that she laboriously compiled. Caroline 
features prominently throughout Lemonick’s 
book, although by the epilogue she has been 
reduced to a cranky old woman who helps her 
nephew, the Victorian astronomer John Her- 
schel, to assume the mantle of genius passed 
down from his father. 

Whereas Rooney presents his own origi- 
nal research in an accessible way, Lemonick 
behaves more like a journalist providing a 
colourful version of well-known historical 
and scientific material. His major source of 
information is the astronomy expert Michael 
Hoskin of the University of Cambridge, UK, 
and Lemonick repeatedly portrays himself as 
a reporter transmitting the privileged con- 
versations he enjoyed in Hoskin’s Cambridge 
home. Hoskin is indeed a leading authority 
on the Herschels, but Lemonick unsceptically 
attributes to him the ability “to go beyond the 
straightforward facts... and to understand 
their complex and remarkable personalities”. 
At the end of The Georgian Star, Lemonick 
approvingly reproduces William Herschel’s 
boast that, thanks to his telescope, “I have 
looked further into space than ever human 
being did before me.’ In contrast with this 
eulogy of a powerful instrument, Rooney’s 
closing remark is that Ruth Belville “provided 
what no electrical wire could: the personal 
touch” Because she supplied what her aistom- 
ers wanted, her service outlasted sophisticated 
inventions. a 
Patricia Fara is the senior tutor of Clare College, 
University of Cambridge, Cambridge CB2 1TL, 
UK. Her forthcoming book is Science: A Four 
Thousand Year History. 
e-mail: pfl10006@cam.ac.uk 


No species is an island 


The Loom of Life: Unravelling Ecosystems 
by Menno Schilthuizen 
Springer: 2008. 220 pp. £37.50 


In the summer of 1966, Harvard biologist 
Edward O. Wilson and his student Daniel 
Simberloff undertook a dassic ecology experi- 
ment in Florida. They identified a number of 
minuscule mangrove islands, took a census 
of their mostly insect fauna, and then paid 
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an exterminator to kill all the animals on 
the islands with methyl bromide. Observing 
the islets as they became repopulated, they 
found that, after eight months, nearly all had 
regained the same number of species as they 
had hosted before the extermination. But 
most of these inhabitants were not of the same 
species as before. 

The scientists’ interventions confirmed the 
theory of island biogeography that Wilson and 
Robert MacArthur had published some years 
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earlier. It states that the number of species on an 
island depends on the rates of immigration and 
extinction, which are influenced by the distance 
of an island from the mainland and the island’s 
size. Notably, the number does not depend on 
what the species are, nor on the roles they have 
in their ecosystems, known as niches. 

Evolutionary biologist and science writer 
Menno Schilthuizen retells the story of the 
Wilson-Simberloff experiment at some length 
in The Loom of Life. His book is a readable, 
anecdotal introduction to the ecology of diver- 
sity, addressing basic questions about why 
there are so many different species and why 
some species are rare and others common. 

Yet the book is hard to get into. It lacks 
a straightforward introduction and, with 
uninformative chapter titles such as ‘The 
more the merrier’, a reader who idly picks it 
up would need to read 50 pages to understand 
where all the stories are going. It is not until 
this point in the book — after touring through 
the scientific usefulness of isolated ecosys- 
tems, the history of the idea of food chains, 
the staggering amount of global biodiversity, 
and the idea that each species inhabits a 
slightly different niche — that the first hints 
of dispute enter the plot. 

Schilthuizen sets out the idea that two species 
can coexist in the same ecosystem only if they 
have different niches. Evolutionary theory 
predicts that two species with the same living 
requirements and appetites would go head-to- 
head, and if one is even infinitesimally more 
efficient at exploiting their shared niche, it will, 
over the course of generations, outcompete 
and exterminate the other. Then he drops 
the bombshell. What about phytoplankton? 
Any one patch of water contains hundreds of 


different species, all swimming in the same 
water, all competing for the same energy 
source: sunlight. Even with the odd difference 
in mineral uptake, how could so many niches 
exist in the uniform surface of the sea? Ecolo- 
gists could go mad trying to find the minute 
niche differences at work; maybe there is an 
easier way. 

After launching into the history of island 
biogeography, Schilthuizen introduces 
Stephen Hubbell’s unified neutral theory of 
biodiversity, which Schilthuizen describes 
as aiming “to mimic and predict real, broad 
brush biodiversity patterns while ignoring the 


niches that undeniably existed”. The theory 
puts forth a simple predictor of the abundance 
of various species in an ecosystem using only 
the rate of migration and a fundamental 
biodiversity number that takes into account 
the rate of speciation. In general, the theory 
shows how abundances of each species drift 
up and down. Sometimes new species enter 
the system, occasionally one will go extinct, 
and now and then one will speciate into two. 

It is odd that the book does not give a pre- 
cise definition of ‘neutral’ in the neutral theory, 
namely the assumption that any differences 
between the behaviour of the various species 
being considered — all the trees in a brest, say, 
or all the corals on a reef — have no overall 
effect in the model. 

Clearly a fan of the theory, Schilthuizen 
makes time fr its opponents, even if he undiar- 
itably calls their negative reactions “kneejerks’. 
He sets out the evidence from both sides but 
avoids hashing it out, jumping on instead to 
ecosystem stability. All these questions are 
related, but the book counters expectations by 
becoming a survey of the great questions about 
the ecology of diversity instead of an argument 
in favour of the neutral theory. 

The Loom of Life is useful. Much of the 
public — and even some of the professional 
environmental movement — knows little about 
the rules ecologists have posited for creating 
and maintaining biodiversity. They might read 
this slender book for a bearing on how to tackle 
environmental problems. a 
Emma Marris is a correspondent for Nature 
based in Columbia, Missouri. 


Paris feels the polar chill 


Atmosphere... The Climate Revealed by Ice 
Museum of Arts and Crafts, Paris 
Until 30 April 2009 


With rising levels of emotion clouding the 
political debate about global climate change, 
a new exhibition at the Museum of Arts and 
Crafts (Musée des Arts et Métiers) in Paris 
provides a timely reminder that accurate data 
are the basis of scientific objectivity. Running 
until April next year, Atmosphere shows how 
the past 50 years of scientific observation in 
the Arctic and Antarctic have provided evi- 
dence of Earth’s vulnerability and changed 
how we view our planet. The exhibition, says 
geochemist Jean Jouzel, one of its two scien- 
tific commissioners, “provides a clarion call 
to use this carefully collected data to take 
informed action against climate change”. 
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The starting point for Atmosphere is the 
International Geophysical Year of 1957-58. 
Objects on show in the exhibition include 
scientific instruments used at the poles, 
newspaper reports, scientific papers and 
books, photographs and films. Charcot, the 
first French polar expedition base in Antarc- 
tica, is photographed almost submerged in 
snow and resembles a surfacing submarine, 
its tricolour flag taut in the Antarctic wind. 
Although the focus is on raising awareness 
of the French contribution to polar research, 
the work of other nations and international 
collaborative groups is also presented. Polar 
wildlife and landscape photographs, induding 
spectacular aurora borealis displays, document 
the fauna and natural beauty of the polar 
regions. The 130 exhibited objects range from 
an apparatus that simulates the aurora borealis 
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in the laboratory, to a ‘Weasel’ tracked vehicle 
used for transport across the snow and ice. 

Jouzel’s own research is included — his 
sampling to a depth of 3,000 metres, of polar 
ice that was laid down some 500,000 years 
ago, is crucial to our current understanding 
of how Earth’s atmosphere has changed dur- 
ing this time. By analysing the composition 
of tiny bubbles of air trapped in polar ice, he 
established a correlation between temperature 
and the concentration of greenhouse gases, 
specifically carbon dioxide and methane, 
during successive climatic cycles. By reveal- 
ing what occurred in the past, scientists can 
understand how best to prevent further 
damage to the planet. Several conferences 
will be held during the exhibition to alert the 
public to the need to change attitudes and 
behaviours that contribute to global warming. 
“But Sarah Palin is not welcome,” jokes Jouzel, 
referring to the climate-change scepticism of 
the Republican vice-presidential nominee in 
the US election. “She is not invited” a 
Colin Martin is a writer based in London, UK. 
e-mail: cmpubrel@aol.com 


See www.arts-et-metiers.net for more details. 
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A snapshot of hidden science history 


Brought to Light: Photography and the 
Invisible, 1840-1900 

San Francisco Museum of Modern Art, 
California 

Until 4 January 2009 


The use of photography in science during the 
nineteenth century is traced in a new exhibi- 
tion at the San Francisco Museum of Modern 
Art, running until 4 January 2009. Brought to 
Light showcases scientific photographs that 
reveal phenomena invisible to the unaided 
eye, from early X-rays of the body’s skeleton 
to fields of electricity. 

Scientists first considered photography as 
little more than a quicker way of recording the 
world than drawing, but soon realized that it 
could be used to gain access to aspects of the 
world that had previously been off limits. Early 
photographs of the Moon, the surface of the 
Sun, and of distant nebulae and their spectra 
changed astronomers’ understanding of our 
Universe. Popular photographs of microscopic 
objects, such as crystals and insects, revealed 
new complexity in the everyday world. Indeed, 
in 1877, astronomer Pierre Jules César Janssen 
declared that the photographic plate was the 
“true retina of the scientist”. 
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Hermann Schnauss's photo revealed how sparks 
of electricity emanate from a metal wire gauge. 


Photography evolved rapidly with the 
new sciences. An X-ray image that Wilhelm 
Conrad Réntgen took of his wife’s hand in 
1895, for instance, was widely reproduced 
by others who were fascinated by the abil- 
ity to see inside the human body. As exhibit 
curator Corey Keller writes in the catalogue, 
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“Today we are hardly shocked by the sight 
of our own bones, but we can glean from 
the remarkable public response just how 
extraordinary these pictures were when they 
first appeared, simultaneously suggesting an 
assault on privacy and conjuring the specter 
of death.” 

Keller argues that, by showing there was 
more going on in the world than was appar- 
ent to the unaided eye, scientific photographs 
inspired a fundamental shift away from faith 
in the human observer to faith in the authority 
of technology. And by the end of the nine- 
teenth century, the heyday of the ‘educated 
amateur’ had ended. Scientists were becoming 
the specialized practitioners they are today, 
using powerful, expensive and complicated ‘ 
tools to explore realities that remain invisible 
to most. 

By revealing the hidden forces at work in 
nature, the pictures collected in this exhibit 
have much in common with iconic modern 
images, such as the views of galaxies in the 
distant Universe captured by the Hubble Space 
Telescope. In that way, the exhibit does not just 
illuminate the past — i also gives a glimpse of 
the future. 7 
Erika Check Hayden is a correspondent for 
Nature based in San Francisco. 
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Economics needs ascientific revolution 


Financial engineers have put too much faith in untested axioms and faulty models, says Jean-Philippe 
Bouchaud. To prevent economic havoc, that needs tochange. = | =| 


Compared with physics, it seems fair to say 
that the quantitative success of the economic 
sciences has been disappointing. Rockets fly 
to the Moon; energy is extracted from minute 
changes of atomic mass. What is the flagship 
achievement of economics? Only its recurrent 
inability to predict and avert crises, including 
the current worldwide credit crunch. 

Why is this so? Of course, to paraphrase Isaac 
Newton, modelling the madness of people is 
more difficult than modelling the motion 
of planets. But statistical regularities should 
emerge in the behaviour of large populations, 
just as the law of ideal gases emerges from the 
chaotic motion of individual molecules. To 
me, the crucial difference between modelling 
in physics and in economics lies rather in how 
the fields treat the relative role of concepts, 
equations and empirical data. 

Classical economics is built on very strong 
assumptions that quickly become axioms: the 
rationality of economic agents (the premise 
that every economic agent, be that a person or 
a company, acts to maximize his profits), the 
‘invisible hand’ (that agents, in the pursuit of 
their own profit, are led to do what is best for 
society as a whole) and market efficiency (that 
market prices faithfully reflect all known infor- 
mation about assets), for example. An econo- 
mist once told me, to my bewilderment: “These 
concepts are so strong that 
they supersede any empiri- 
cal observation.” As econo- 
mist Robert Nelson argued 
in his book, Economics as 
Religion (Pennsylvania State Univ. Press, 2002), 
the marketplace has been deified. 

Physicists, on the other hand, have learned 
to be suspicious of axioms. If empirical obser- 
vation is incompatible with a model, the 
model must be trashed or amended, even if 
it is conceptually beautiful or mathematically 
convenient. So many accepted ideas have been 
proven wrong in the history of physics that 
physicists have grown to be critical and queasy 
about their own models. 

Unfortunately, such healthy scientific 
revolutions have not yet taken hold in econom- 
ics, where ideas have solidified into dogmas. 
These are perpetuated through the education 
system: students don’t question formulas they 
can use without thinking. Although numer- 
ous physicists have been recruited by financial 


"Classical economics has no 
framework through which to 
understand ‘wild’ markets.” 


institutions over the past few 
decades, they seem to have 
forgotten the methodology 
of the natural sciences as they 
absorbed and regurgitated the 
existing economic lore. 
The supposed omnis- 
cience and perfect 
efficacy of a free mar- 
ket stems from eco- 
nomic work done in 
the 1950s and 1960s, 
which with hind- 
sight looks more like 
propaganda against 
communism than 
plausible science. In 
reality, markets are 
not efficient, humans 
tend to be over-focused 
in the short-term and blind 
in the long-term, and errors get amplified, ulti- 
mately leading to collective irrationality, panic 
and crashes. Free markets are wild markets. 


Picture imperfect 

Reliance on models based on incorrect axioms 
has clear and large effects. The Black-Scholes 
model, for example, which was invented in 
1973 to price options, is still used extensively. 
But it assumes t hat the 
probability of extreme 
price changes is negligi- 
ble, when in reality, stock 
prices are much jerkier 
than this. Twenty years ago, unwarranted use of 
the model spiralled into the worldwide October 
1987 crash; the Dow Jones index dropped 23% 
in a single day, dwarfing recent market hiccups. 
Ironically, it was the very use of a crash-free 
model that helped to trigger a crash. 

This time, the problem lies, in part, in the 
development of structured financial products 
that packaged subprime risk into seemingly 
respectable high-yield investments. The mod- 
els used to price them were fundamentally 
flawed: they underestimated the probability 
that multiple borrowers would default on 
their loans simultaneously. These models again 
neglected the very possibility of a global crisis, 
even as they contributed to triggering one. 

Surprisingly, classical economics has no 
framework through which to understand 
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‘wild’ markets, even though 
their existence is so obvious to 
the layman. Physics, on the other 

hand, has developed several 
j models that explain how 
small perturbations can 

lead to wild effects. The 
theory of complexity 
shows that although 
a system may have an 
optimum state, it is 
sometimes so hard 
to identify that the 
system never settles 
there. This o pti- 
mum state is not 
only elusive, it is 
also hyper-fragile 
to small changes 
in the environment, 

and therefore often 

irrelevant to understanding what is going on. 

There are good reasons to believe that this 
paradigm should apply to economic systems in 
general and financial markets in particular. We 
need to break away from classical economics 
and develop completely different tools. Some 
behavioural economists and econo-physicists 
are attempting to do this now, in a patchy way, 
but their fringe endeavour is not taken seri- 
ously by mainstream economics. 

While work is done to enhance models, 
regulation also needs to improve. Innovations 
in financial products should be scrutinized, 
crash-tested against extreme scenarios outside 
the realm of current models and approved by 
independent agencies, just as we have done with 
other potentially lethal industries (chemical, 
pharmaceutical, aerospace, nuclear energy). 

Crucially, the mindset of those working in 
economics and financial engineering needs to 
change. Economics curricula need to include 
more natural science. The prerequisites for 
more stability in the long run are the develop- 
ment of a more pragmatic and realistic rep- 
resentation of what is going on in financial 
markets, and to focus on data, which should 
always supersede perfect equations and aes- 
thetic axioms. . 
Jean-Philippe Bouchaud is head of research 
of Capital Fund Management and a physics 
professor at Ecole Polytechnique in France. 
e-mail: jean-philippe.bouchaud@cea.fr 
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The sands of tsunami time 


Stein Bondevik 


The scale of the 2004 tsunami that devastated shores around the Indian Ocean has no precedent in written 
histories of the region. But evidence of similar events has been unearthed from the geological record. 


The huge earthquake and ensuing tsunami 
in the Indian Ocean on 26 December 2004 
killed more than 220,000 people in 11 coun- 
tries. Hundreds of years of accumulated stress 
in the Sunda Trench was released within a 
few minutes and drove the Indo- Australian 
tectonic plate an average of 13 metres beneath 
the Burma-Sunda plate’. 

Written history in Indonesia — the country 
that suffered the most — goes back 400 years 
without any mention of an event of similar 
severity~’. So how long did it take to accumu- 
late the energy stored along the fault that was 
so destructively released in 2004? hh this issue, 
Jankaew et al. (page 1228)* and Monecke et al. 
(page 1232)° present evidence of previous tsu- 
namis from Thailand and northern Sumatra, 
respectively, the results of months of hard field- 
work. Below sand left by the 2004 tsunami and 
accumulations of peat, they found sand sheets 
that resembled those deposited in 2004. Both 
studies show a sand layer dated to 600-700 
years ago (ad 1300-1400) t hat could be the 
most recent full-sized predecessor of the 2004 
tsunami. Such a long build-up of stress could 
explain the scale of that event (Fig. 1). 

Ancient tsunami deposits are only rarely 
preserved and can be hard to find. A prerequisite 
is that there is some kind of a sediment trap in 
which the sand left by the tsunami is protected 
from wind, burrowing animals, running water 
and human activity. The authors of both papers 
found low areas between beach ridges, called 
swales, to be such sediment traps. Swales are wet 
and marshy, are occasionally submerged, and 
hold deposits of peat and other organic matter. 
The difference between the dark organic 
peat/soil and the light-coloured tsunami 
sand helps to trace the sand sheets. Jankaew 
et al.* observed that in wet swales the 2004 
tsunami sand already (as of March 2007) has 
a protective cover of as much as 5 centimetres 
of organic matter. 

The buried sand sheets in the swales are 
similar to the sand washed up by the 2004 
tsunami. But how do the authors know that the 
sand sheets were not deposited by other agents 
such as wind, rivers or storms? The reason is 
the geographical setting of the sites sampled 
by both groups*”. The sites are not exposed to 


Figure 1| The town of Lhoknga, on the northwest coast of Sumatra, before (left) and after the tsunami 
of 26 December 2004. These satellite photographs show the force and extent of the tsunami. The only 
building to survive was the mosque, seen as the prominent white structure on the right of both images. 
From sand layers in Thailand’ and northern Sumatra’, the best estimate is that the last tsunami of a 
similar scale occurred 600-700 years ago. 


intense storms; rivers do not reach them; and 
there are no sand dunes as evidence of wind 
activity. Careful mapping and examination of 
internal structures show the sand to have been 
deposited from the sea side. Incorporation of 
plant debris and soil in the sand are evidence of 
strong erosion and turbulent water flow across 
vegetated land. 

Tsunami deposits can be difficult to date, and 
further work needs to be done to obtain more 
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accurate and precise ages. Both teams used the 
radiocarbon method. Many of the ages they 
obtained are from samples of organic mat- 
ter collected below the sand sheets. For these 
samples, there is the potential error that 
strong currents in the tsunami flow stripped 
away years of organic deposition, just before 
the sand settled®. The measured ages would 
then be older than the true age of the tsunami 
concerned. Interestingly, Jankaew et al.* found 
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leaf fragments in the sand that were several 
thousand years older than the age obtained 
from bark fragments in the soil just below the 
sand layer. The age of the leaf fragments means 
that older material must have become mixed 
into the tsunami deposits, and shows the 
importance of selecting the right material for 
radiocarbon dating of such deposits. 

Three older tsunami events were identified, 
but only the youngest of them correlates 
between the two regions. This sand sheet 
was dated to ad 1290-1400 in Sumatra’ and 
ad 1300-1450 in Thailand’, and could be the 
last big forerunner of the 2004 tsunami. The 
oldest sand sheet in Simatra, dated to ad 780- 
900, seems to lack a corresponding sheet with 
the same age in Thailand. However, Jankaew 
et al.* show, in one of their supplementary fig- 
ures (Fig. S2c), indistinct and patchy remains 
of a sand sheet that they did not radiocarbon 
date, but that must be older than ad 1450 and 
younger than 395 bc. This vague sand sheet 
could be the same as the oldest sand sheet in 
Sumatra and should be subjected to dating. 
The third and oldest sand sheet in Thailand’ is 
about 2,200 years old, and older than the depos- 
its studied by Monecke et al.* in Sumatra. 

If events such as the 2004 tsunami happen 
only every 600 years or so, there are considera- 
ble implications for urban and coastal planning 
in the region. The inhabitants might consider 
the benefits of living close to the sea as greater 
than the risks of a catastrophic tsunami that 
will not return for many generations. Also, it 
does not make sense to invest in and maintain a 
warning system for devastating tsunamis if they 
recur so infrequently. But smaller tsunamis may 
well happen more often’, and a warning sys- 
tem should save lives during such events. More 
geological work is also needed to measure and 
verify the 600-year-long recurrence intervals 
for exceptional tsunamis such as that of 2004. 

The findings of Jankaew et al. and Monecke 
et al.” are the first steps towards compiling a 
record of ancient tsunamis in the Indian Ocean, 
and they should inspire and encourage geol- 
ogists to continue the work. There is plenty to 
do. To conclude that these past tsunamis were 
as big as the 2004 event, we need to see the same 
layers on other shores around the Indian Ocean. 
And the radiocarbon chronology of tsunamis 
has to be verified at different locations. a 
Stein Bondevik is in the Department of Geology, 
University of Tromsg, Norway, and the Faculty 
of Engineering and Science, Sogn og Fjordane 
University College, NO-6851 Sogndal, Norway. 
e-mail: stein.bondevik@hisf.no 
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Small regulatory RNAs pitch in 


Ulrich Technau 


How did organismal complexity evolve at a cellular level, and how does a 
genome encode it? The answer might lie in differences, not in the number of 
genes an organism has, but rather in the regulation of gene expression. 


It is co mmonly believed t hat complex 
organisms arose from simple ones. Yet analyses 
of genomes and of their transcribed genes 
in various organisms reveal'” that, as far as 
protein-coding genes are concerned, the rep- 
ertoire of a sea anemone — a rather simple, 
evolutionarily basal animal — is almost as 
complex as that of a human. Grimson et al.’ 
might have the answer to this paradox. They 
report in this issue (page 1193) that differences 
in complexity between some evolutionarily 
ancient, morphologically simple animals and 
the highly sophisticated primates could be 
explained, at least in part, by the regulation of 
gene expression by small RNAs. 

One of the three main classes of noncoding 
regulatory RNA expressed by bilaterians — 
organisms that have a bilateral symmetry — is 
the microRNAs (miRNAs). These sequences 
exist as RNA duplexes (roughly 21-24 nucleo- 
tides long), one strand of which usually 


Number of 
miRNAs 


binds to its target messenger RNA, reducing 
the protein output from that message. Each 
miRNA has tens to hundreds of target 
mRNAs, adding a notable level of complex- 
ity to the regulation of gene transcription*”. 
In humans, hundreds of miRNAs have been 
identified, yet the search for related sequences 
in evolutionarily ancient and morphologically 
simple non-bilaterian animals (such as sponges, 
comb jellies and cnidarians) has led ‘ only two 
to three putative miRNA sequences, found in 
the sea anemone Nematostella vectensis**. 

To take a doser look at the problem, Grimson 
et al.’ used deep-sequencing techniques 
(454 Life Sciences and Illumina sequencing 
machines) to analyse what was presum- 
ably the nearly full complement of small 
noncoding RNAs in a range of morphologi- 
cally simple organisms. These included a uni- 
cellular choanoflagellate (Monosiga brevicollis) 
and several representatives of non-bilaterian 
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Figure 1| Animal miRNAs and morphological complexity. Grimson et al.’ (data along red lines) reveal 
the evolutionary origin of animal miRNAs by examining organisms at the base of the animal tree. 
Combining their data with previous work, three different measures of complexity become apparent: 
the number of protein-coding genes, total number of neurons and number of miRNAs. There is 
relatively little correlation between morphological complexity and the number and diversity of 
protein-coding genes. However, miRNA number correlates well with the organism's total number of 
neurons. Indeed, a large proportion of vertebrate miRNAs are expressed in the nervous system. These 
data also show the dynamic nature of the miRNA complement in each lineage, particularly visible in 
rapidly evolving species (Oikopleura and fruitfly). 
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multicellular phyla: a cnidarian (N. vectensis), 
a sponge (Amphimedon queenslandica) and a 
placozoan (Trichoplax adhaerans). The present 
study’ also demonstrates the greater power of 
unbiased deep sequencing at near-saturation 
levels compared with approaches that rely on 
sequence similarities alone. 

In the sponge, the authors find eight differ- 
ent miRNAs. Moreover, they report that the sea 
anemone has, in fad, not just three but at least 
40 different miRNAs. Although, compared 
with the previous estimates *, 40 miRNAs is 
clearly more significant, when compared with 
the 147 miRNAs identified in the fruitfly Dro- 
sophila and the 677 human miRNAs, this is not 
a high number. A pattern emerges, however: 
miRNA diversity in these three organisms 
seems to correlate with an increase in their 
relative morphological complexity (Fig. 1), 
even after accounting for some inflation in 
the number of human miRNAs owing to 
possible mis-annotation of certain sequences 
as miRNAs. 

All but one of the sea anemone miRNAs 
seem to lack any resemblance to bilaterian 
miRNAs, with the one exception being almost 
identical in sequence to the bilaterian miRNA 
known as miR-100. The similarity of miR-100 
sequences among sea anemone, fly and ver- 
tebrate pushes the origin of animal miRNAs 
back to at least the last common ancestor of 
sea anemones and vertebrates, some 600 mil- 
lion years ago. The eight sponge miRNAs do 
not match those of either the sea anemone 
or bilaterians. But, as the miRNA biosynthe- 
sis machinery is clearly fully conserved in 
sponges’, it seems plausible that the origin of 
animal miRNA dates back to the last ancestors 
of all animals — the Urmetazoa. 

Grimson et al. could not convincingly detect 
miRNAs in either the placozoan (Tricho flax 
is a very simple organism, more like a multi- 
cellular amoeba than an animal) or the choano- 
flagellate, putatively the closest unicellular 
sister group of multicellular animals. Both of 
these species also seem to lack Pasha’, a protein 
crucial for the biosynthesis of miRNAs. Unlike 
other proteins of the miRNA biosynthesis 
machinery, Pasha’s only known function is in 
miRNA processing. Loss of the gene encod- 
ing Pasha should therefore directly and only 
affect miRNA generation. Consequently, as the 
latest, but not uncontested, molecular phylo- 
geny’ places Placozoa between the sponges and 
Cnidaria, the absence of miRNAs in Placozoa 
is probably due to secondary loss, whereby a 
single gene loss may have led to the complete 
loss of miRNAs in these organisms. Moreover, 
assuming a role for miRNAs in supporting 
higher morphological complexity, their loss, 
together with other gene losses’, may have led 
to the secondary morphological simplification 
of the placozoan body plan. 

Besides miRNAs, bilaterians possess another 
type of (slightly longer) small RNA called 
piRNA. This comes in two forms, both typi- 
cally expressed in the germ line — the cells 


that produce gametes. Grimson et al. find 
high numbers of both piRNA subclasses in 
sponges and sea anemones. Strictly speak- 
ing, members of these two phyla do not have a 
classical germ line. Instead, multipotent 
stem cells, which can differentiate into diverse 
cell types — both somatic and germline — 
ensure lifelong production of gametes. Future 
work will undoubtedly aim to elucidate the 
function of piRNAs during stem-cell differ- 
entiation and gamete formation in such basal 
metazoans. 

The finding that two basal metazoans pos- 
sess relatively high numbers of miRNAs and 
piRNAs that have almost no related sequences 
in Bilateria suggests that cnidarian and sponge 
miRNAs have largely diversified independently 
of bilaterian miRNAs. But is this a rule or an 
exception in the evolution of these organisms? 
One view is that, once fixed in the population, 
miRNAs are seldom lost. Consequently, there 
would be a constant increase in diversity, cor- 
relating with an increase in morphological 
complexity’. Indeed, Grimson and colleagues’ 
data confirm a correlation between miRNA 
diversity and morphological complexity as 
measured, for instance, by the total number 
of neurons in an organism (Fig. 1). But work 
on rapidly evolving species, such as several 
species of Drosophila and the planktonic 
urochordate Oikopleura, revealed’”" both 
extensive gain and loss of many miRNAs over 
relatively short evolutionary timescales. This 
indicates the dynamic evolutionary nature of 
many miRNAs, and explains the diversity of 
the miRNA complement in different lineages. 

What could simple organisms such as the sea 
anemone and sponges use their ‘respectable’ 


number of miRNAs for? With no functional 
data available, one can only speculate. Many 
cnidarians have a rather complex, multi- 
stage life cycle. So, as in worms, flies and 
mammals, miRNAs could be involved in 
the timing of these different developmental 
stages. Another striking feature of cnidar- 
ians is their enormous capacity to regener- 
ate. This property relies on the continuous 
differentiation of stem cells, possibly requiring 
miRNAs as part of a well-buffered molecular 
patterning system to ensure homeostasis. As 
for the role of piRNAs, in Bilateria they are 
usually involved in silencing mobile genetic 
elements called transposons. Intriguingly, 
compared with other animal genomes, that of 
the sea anemone also contains a remarkable 
diversity of transposable elements. Control 
of genome integrity by piRNAs during gam- 
ete formation might therefore be crucial for 
the survival of these basal metazoans over 
evolutionary timescales. a 
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ANALYTICAL CHEMISTRY 


Plasma bubbles detect elements 


José A. C. Broekaert 


Bunsen didn't just invent the burner — he also worked on spectral analysis, 
in which compounds are heated in flames to produce distinctive light 
emissions. The latest device replaces flames with microplasmas. 


Budding chemists have long enjoyed high- 
school experiments in which metal salts 
are heated in a flame to produce distinctive 
colours: lilac for potassium, intense yellow for 
sodium, brick red for calcium, and so on. But 
metal salts also produce distinctive colours 
when placed in a plasma. Staack et al.' now 
report in Angewandte Chemie that plasma 
can be formed at the tip of a microelectrode 
in solution, where it excites electrons in the 
solute. Upon relaxation of the electrons to the 
ground state, a distinctive spectrum of light is 
produced (the optical atomic spectrum), which 
can be used to detect and analyse the elements 
contained in the solution. This discovery could 
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form the basis of portable analytical devices 
and environmental sensors. 

Optical atomic spectroscopy has an illustri- 
ous history. In the nineteenth century, Robert 
Bunsen and Gustav Kirchhoff nebulized solu- 
tions in flames, dispersed the emitted light 
with a prism, and used the intensities of lines 
in the resulting atomic spectra to determine 
the concentrations of elements in the solutes’. 
Indeed, it was this method that led to their dis- 
coveries of caesium and rubidium. Then, in the 
twentieth century, many plasma radiation 
sources were introduced for spectrochemical 
analysis in the laboratory. Today, plasma atomic 
spectrometry is used to analyse samples from 
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Figure 1| Small balls of fire. Staack et al.' 

report a technique for detecting the elements 

in a solution. a, An electrode with a sharp tip 
(red) is immersed in a solution with a counter- 
electrode. High voltages are generated, which 
discharge to produce a small plasma bubble at 
the tip. The discharge can be observed with a 
microscope, and the spectrum of light produced 
from the plasma can be recorded using an optical 
fibre. The spectra produced are distinctive of 
the elements present in the solution. (Graphic 
modified from ref. 1.) b, This corona formed 

on an electrode tip immersed in blood plasma. 
(Image from ref. 1.) 


such diverse origins as water, soil and plants, 
and several sampling devices allow the analysis 
of not only solutions, but also solid samples. 
The plasmas used can be generated by various 
methods, induding electrical discharges (such 
as arcs and sparks) and lasers’. These methods 
are robust, and the resulting plasmas can be 
used for the accurate, precise analysis of ele- 
ments in a sample, including those present in 
trace quantities. But there’s a drawback — the 
instruments are bulky and expensive. 

There is therefore a pressing need for smaller 
instruments that are cheap to make and run, 
and that can be easily transported for field 
studies. This has spawned many attempts to 
make minute plasma sources for use with mod- 
ern, miniaturized optical spectrometers. Some 
success has been achieved using microwaves 
to induce plasmas in gas-containing channels 
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built into quartz wafers*. These inexpensive 
devices proved to be useful for analytical tasks 
— for example, they have been used success- 
fully to determine the concentrations of toxic 
elements, such as mercury, in water®. But for 
this method to work, the sample has to be con- 
verted into a gas using volatilization reactions 
before analysis. Staack et al.’ demonstrate that 
a minuscule plasma discharge can be produced 
directly in solution, and still be used to obtain 
atomic emission spectra from the solute. 

In Staack and colleagues’ device, high volt- 
ages are produced at a sharp tip (a metal elec- 
trode, a carbon wire or a quartz pipette covered 
with gold) when it is immersed in a solution 
together with a counter-electrode (Fig. 1a). 
The high voltage generates bubbles, formed 
from a combination of solute vapour, gases 
dissolved in the solute and solute-decom- 
position products. But the bubbles also take 
up elements contained in the solution. 
The bubbles are released at the tip, and go 
on to form a plasma discharge (Fig. 1b). The 
discharge measures about 10 micrometres 
across, and the light emitted from it con- 
tains the atomic spectral lines of the ele- 
ments present in the solution. The intensities 
of the lines vary widely depending on such 
factors as the diameter and nature of the 
electrode; the amplitude, duration and 
frequencies of the voltage pulses; and the 
types of element present in the solution and 
their concentrations. Rather like a flame test, 
solutions of sodium chloride, tap water and 
blood therefore each give plasma discharges 
of differing colour and with different spectra. 

Of course, it is still early days for this tech- 
nique. The plasma discharges produced in 
Staack and colleagues’ system have not been 
characterized to determine their tempera- 
tures or electron densities. Furthermore, the 
exact processes through which elements in the 


solute are excited are not known. It will also be 
necessary to investigate the wavelength range 
over which radiation can be sampled spectro- 
metrically — which, in turn, determines the 
elements that can be detected. Furthermore, 
the dependence of the excitation efficiency on 
the dimensions of the microplasmas must be 
quantified. Finally, all the working parameters 
of the technique must be precisely determined, 
so that any future devices can be tailored to 
specific analytical tasks. Nevertheless, the 
method has great promise: the authors show’ 
that the signals produced in their experiments 
are proportional to the concentrations of ele- 
ments in solution, and that quantities of solutes 
as small as a £w micrograms can be detected 
per millilitre of solution. 

The big advantage of Staack and colleagues’ 
system’ is likely to be its portability. The low 
cost of the instrument and its operation should 
allow it to be used in the field, and devices 
could even be permanently installed within the 
flows of the liquids to be analysed — no Imger 
would samples have to be transported to the 
laboratory for analysis. Such devices therefore 
have potential for environmental applications, 
for example to detect increased concentrations 
of elements in rivers, or in industrial waste 
streams. a 
José A. C. Broekaert is in the Department 
of Chemistry, University of Hamburg, 
Martin-Luther-King-Platz 6, D-20146 Hamburg, 
Germany. 
e-mail: jose.broekaert@chemie.uni-hamburg.de 


1. Staack, D. et al. Angew. Chem. Int. Edn 47, 8020-8024 
(2008). 

2. Kirchhoff, G. R. & Bunsen, R. Phil. Mag. 20, 89-98 (1860). 

3. Broekaert, J. A.C. Analytical Atomic Spectrometry with 
Flames and Plasmas (Wiley-VCH, 2005). 

4. Franzke, J. & Miclea, M. Appl. Spectrosc. 60, 80A-90A 
(2006). 

5. Engel, U., Bilgic, A. M., Haase, O., Voges, E. & Broekaert, 
J.A.C. Anal. Chem. 72, 193-197 (2000). 


CELL BIOLOGY 


A Listeria escape trick 


Grace Y. Lam and John H. Brumell 


Pathogens have many ways of subverting their hosts’ molecular machinery. 
Astriking example of sucha ploy comes to light from investigations of the 
species of bacterium that causes listeriosis. 


A recent outbreak of Listeria monocytogenes 
infection in Canada’ has to date claimed 
20 lives out of 53 cases and is a stark reminder 
that these bacteria pose a serious threat to public 
health. Ingestion of contaminated food can 
have especially serious consequences for those 
at high risk — pregnant women, newborns, 
the elderly, and individuals with compromised 
immune systems. 

A better understanding of L. monocytogenes 
will depend on insight into the interaction 


© 2008 Macmillan Publishers Limited. All rights reserved 


between the bacterium and its host at the 
cellular level. Typically, bacteria are engulfed by 
macrophages, versatile agents of the immune 
system. They are then contained in a cellu- 
lar compartment called a phagosome, which 
gradually becomes acidified and fuses with a 
compartment containing digestive enzymes, 
the lysosome, destroying the invader in the 
resulting phagolysosome. However, L. mono- 
cytogenes can avoid destruction by escaping 
through the phagosomal membrane into the 
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cell cytosol, where it replicates and goes on 
to infect neighbouring cells. The key player in 
the escape process is the bacterial virulence 
factor, listeriolysin O (LLO). On page 1244 
of this issue, Singh et al.” reveal that L. mono- 
cytogenes takes advantage of a host factor to 
promote LLO activity during infection. 

Listeriolysin O belongs to a family of chol- 
esterol-dependent cytolysins that are character- 
ized by their ability to oligomerize and form 
pores in host membranes. It is thought 
to mediate L. monocytogenes escape 
from phagosomes in two steps. First, in 
response to acidification of the phago- 
some, LLO undergoes structural 
reconfigurations that render the 
protein stable and active’. It then 
forms small pores in the phago- 
some, blocking acidification of this 
compartment and thereby prevent- 
ing its fusion with lysosomes’. This 
provides a ‘window of opportunity’ 
for the second step of bacterial escape, 
which involves the combined actions of 
LLO and two phospholipase enzymes’. 
However, LLO also has the potential to 
kill the host cell through unchecked pore 
formation in the cell’s membrane. That is an 
undesired outcome for the bacterium, which 
depends on the cell for its continued existence. 
Thus, L. monocytogenes has evolved many fail- 
safe mechanisms to limit LLO activity in the 
cytosol (see ref. 3 for a review). 

The activity of LLO is also modulated by the 
phagosomal environment (Fig. 1). NADPH 
oxidase and nitrogen oxide synthase, two 
phagosomal enzyme complexes that pro- 
duce antimicrobial agents — reactive oxy- 
gen and nitrogen species — inhibit escape of 
L. monocytogenes*. This suggests that the redox 
(reduction-oxidation) environment in the 
phagosome can control LLO activity. A single, 
sulphur-containing cysteine amino-acid resi- 
due in the protein may be the target of oxida- 
tion. Indeed, it has been known for some time 
that reducing agents are needed for optimal 
LLO activity in vitro’®. 

How, then, do the bacteria overcome 
LLO oxidation to escape into the cytosol? 
Singh et al.” suggest that the answer lies in a 
host protein, y-interferon-inducible lyso- 
somal thiol reductase (GILT). This enzyme 
normally ensures the proper degradation of 
cysteine-containing peptides as part of another 
macrophage function — presentation of 
foreign antigen at the macrophage surface 
to stimulate antibody production. It seems, 
however, that L. monocytogenes takes advan- 
tage of GILT to promote LLO activity in the 
phagosome. 

Singh et al. first observed that mice lacking 
GILT are partially protected from L. mono- 
cytogenes infection, in that they exhibit reduced 
bacterial replication in the spleen and liver. 
Bacterial escape from the phagosome was 
impaired in the macrophages of these mice, 
a phenomenon that was not due to differing 
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rates of invasion or phagosome acidification. 
When GILT was expressed in GILT-deficient 
cells, bacteria could once again escape effi- 
ciently. Furthermore, the in vitro introduc- 
tion of mutant GILT lacking thiol reductase 
activity into GILT-deficient macrophages 
failed to restore the defect in bacterial escape. 
Conversely, using an LLO mutant in which the 
protein’s single cysteine was replaced with 
alanine and which remains reduced in the 
phagosome, the authors discovered that this 
mutant confers the same bacterial escape abil- 
ity in GIIT-deficient macrophages as in narmal 
cells. Together, these observations show that 
L. monocytogenes escape is impaired if LLO is 
not reduced by GILT. 

Next, Singh et al. examined the mechanism 
of GILT action. They found that LLO reduc- 
tion by GILT involves sulphur groups, as the 
addition of a thiol reactive reagent inhibited 
LLO activity in the presence of GILT. Further, 
they showed that the GILT-LLO interaction 
involves the classic two-step thiol reductase 
mechanism, by engineering a GILT mutant 
that interrupts the mechanism and results in 
an accumulation of the intermediate enzyme- 
substrate complex. When L. monocytogenes 
invaded macrophages bearing this GILT 
mutant, LLO bound to the mutant GILT. The 
data are consistent with the notion that GILT is 
delivered to phagosomes, where it can reduce 
LLO and promote its activity, allowing bacte- 
rial escape. However, because GILT was not 
observed in phagosomes containing the bac- 
teria, it remains possible that GILT affects LLO 
(or other host or bacterial factors) elsewhere in 
the cell to allow bacterial escape. 

It was already known that host factors affect 
the process by which L. monocytogenes escapes 
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Figure 1 | Regulation of the Listeria 
monocytogenes virulence factor listeriolysin O 
(LLO) in the phagosome of a macrophage. 1, In 
the normal sequence, the phagosome gradually 
becomes acidified through the action of proton 
pumps and then fuses with the lysosome 

(not shown), leading to bacterial degradation. 
But this pathway is blocked by L. monocytogenes 
through the function of LLO. As the pH drops, 
LLO undergoes structural reconfiguration, 
rendering it stable and active. It then creates 
pores in the phagosomal membrane, blocking 
acidification and thereby preventing lysosome 
fusion. 2, Reactive oxygen and nitrogen species 
are normally produced in the phagosome by 
NADPH oxidase and nitrogen oxide synthase 
enzymes, respectively. This creates an oxidizing 
environment in the phagosome that inhibits 
LLO function. 3, Singh et al.’ propose that 

L. monocytogenes uses host y-interferon- 
inducible lysosomal thiol reductase (GILT) to 
reduce LLO, promoting LLO’s ability to form 
pores in the phagosomal membrane and allow 
subsequent bacterial escape into the cytosol. 


from phagosomes’. Singh and colleagues’ 
contributions’ are to reveal that GILT regulates 
LLO activity, and to highlight the importance 
of the redox environment in bacterial escape. 
Why the bacteria require multiple levels of reg- 
ulation on LLO is not clear. It is possible that 
exquisite control over the time and place that 
LLO is active is crucial in allowing bacterial 
escape from a specific stage in the phagolyso- 
somal pathway. Another question raised by 
this work is whether other cytosol-adapted 
pathogens, such as Shigella flexneri and Rick- 
ettsia rickettsii, take advantage of host GILT in 
the same manner as L. monocytogenes. If so, 
GILT may prove to be an attractive target for 
therapeutic intervention. a 
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CONDENSED-MATTER PHYSICS 


Borderline metals 


Christian Pfleiderer 


The standard model of metals is found to fail in several cases. The long- 
sought-after marginal state in which such a breakdown occurs has been 
identified in a metal on the border of ferromagnetism. 


In recent years a large number of metals have 
been discovered with properties that seem to 
be inconsistent with the standard model of 
metals — the Fermi-liquid theory. For lack of 
a better understanding, these materials have 
become known as non-Fermi-liquid metals. 
But the difference between a Fermi liquid 
and a non-Fermi liquid is like the difference 
between a banana and a non-banana. On page 
1220 of this issue, Smith et al.! report the tem- 
perature dependence of the electrical resistiv- 
ity of a non-Fermi-liquid state in the 
weakly ferromagnetic metal ZrZn, as a 
signature of the long-sought-after mar- 
ginal Fermi-liquid state — a prediction 
that historically inspired the search for 
quantum criticality, a breakdown of 
Fermi-liquid theory and magnetically 
mediated superconductivity. 

To appreciate the results of Smith 
et al. it is helpful to recall what makes 
Fermi-liquid theory so special. The 
electrical conductivity in metals is due 
to the motion of the conduction elec- 
trons in response to an applied electric 
field. But for the charge-carrier density 
of around 10” per cubic metre that 
exists in typical metals, the bare Cou- 
lomb repulsion between the conduc- 
tion electrons exceeds by many orders 
of magnitude the strength of the elec- 
tric field that causes the electric cur- 
rent. In other words, the movements 
of the conduction electrons in a weak 
electric field are anything but inde- 
pendent of one another. Fermi-liquid 
theory assumes that, despite these very 
strong interactions, it is still possible 
to describe the many-body excitations 
of the system of conduction electrons 
as particle-like entities (quasiparticles) 
that are akin to conduction electrons 
‘dressed’ by clouds of surrounding 
electrons (Fig. 1). 

In addition to its application to 
conduction electrons in metals, Fermi- 


Fermi liquids arise in systems in which 
particle-like excitations exist that inherit 
their statistical properties from the quantum- 
mechanical characteristics of their microscopic 
constituents. In metals, these constituents are 
the conduction electrons, which are fermions. 
Yet the description of the conduction electrons 
in terms of quasiparticles — a key property of 
the Fermi-liquid state — is possible only if 
the quasiparticles live long enough to mani- 
fest their particle-like ‘personality. In pure 


Figure 1| Almost particles. a, In a normal Fermi liquid, each 
constituent particle (dots) is surrounded by a polarization 


compounds this implies that the lifetime of a 
quasiparticle is ultimately limited by scattering 
with other quasiparticles. The lower the tem- 
perature of the system, the fewer quasiparticles 
are excited, the lower the scattering rate, and 
the longer they live. At the limit at which the 
temperature falls towards absolute zero, their 
lifetime becomes infinite. 

The Fermi-liquid state breaks down if the 
scattering rate between the quasiparticles grows 
without bound as the temperature decreases. 
In the simplest hypothesis, two sources could 
give rise to such a singular scattering rate: 
transverse electric currents or spin-spin inter- 
actions, neither of which can be screened by the 
conduction electrons. The former are impor- 
tant only in the microkelvin range’, but spin- 
spin interactions are important even at high 
temperatures. 

The marginal state identified by Smith et al.’ 
is the result of unscreened spin-spin interac- 
tions at the border of ferromagnetism. These 
interactions tie into a controversy 
that began in the 1920s when it was 
first noticed that the small, ordered 
magnetic moment of nickel could not 
be attributed to atomically localized 
moments only. Just like nickel, many 
magnetic metals exhibit a peculiar 
mixture of localized and itinerant mag- 
netic moments. As a major step towards 
resolving this problem, it was recog- 
nized in the 1960s that the dressing 
clouds of quasiparticles could be para- 
magnons: over-damped spin waves’. 

But it was immediately appreciated 
that these dressing clouds would imply 
a breakdown of the Fermi-liquid model 
at a zero temperature ferromagnetic 
critical point (T,=0) — a quantum 
critical point*®. This possibility inspired 
a research programme into the nature 
of quasiparticle properties in strongly 
correlated metals® and ordering phe- 
nomena driven by these interactions, 
such as magnetically mediated super- 
conductivity”®. The result was a quan- 
titative, phenomenological model of 
the effects of spin fluctuations, which 
explained many of the contradictory 
properties of weakly ferromagnetic 
metals’. However, spin-fluctuation 
theory assumes that linear-response 
theory can be used self-consistently to 
describe strong, nonlinear interactions 
even at a quantum critical point*”. 

But perhaps most im portantly, 


liquid theory also applies to all sorts 
of interacting many-body systems of 
particles known as fermions; only one 
such particle can occupy a quantum 
state at a given time. Examples include 
helium-3, *He, for which it was pro- 
posed originally, and nuclear matter in 
atoms and neutron stars. This makes 
Fermi-liquid theory and its limitations 
an area of wide-ranging interest. 


cloud of particles (blue shading) that is dragged along by its 
motion (arrow). The constituent particle and ‘dressing cloud; 
which together form what is known as a quasiparticle, act like 
weakly interacting particles. The blue and red dots illustrate the 
electrons’ differing spin orientations in a nearly ferromagnetic 
metal: for simplicity, spin-up is red and spin-down is blue. b, In 
the marginal state, the dressing cloud depends strongly on the 
temperature, and is so extended that a description of the system 
in terms of quasiparticles is almost invalid. Smith et al.' report 
the identification of a marginal Fermi liquid in the weakly 
ferromagnetic metal ZrZn,,. 


the experimental evidence for the 
expected breakdown of Fermi-liquid 
theory at a ferromagnetic quantum 
critical point was missing. This theo- 
retical hypothesis has prompted sev- 
eral experiments" aimed at finding the 
marginal state, but the interpretation 
of the results is rather controversial. 
Smith and colleagues have now identi- 
fied the marginal Fermi-liquid state in a 
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weak ferromagnet. The marginal state in these 
materials is reminiscent of that proposed in the 
context of high-T, superconductor cuprates, 
but with a subtle difference: in contrast to the 
cuprates, the lifetime of the quasiparticles in 
ferromagnets involves non-local corrections. 
Experimentally, the marginal Fermi-liquid 
state can be identified by looking at the life- 
time of the quasiparticles, which is tradition- 
ally inferred from the temperature dependence 
of the electrical resistivity only. For a Fermi 
liquid, the prediction is that the electrical resis- 
tivity should scale with temperature as T’; for 
a marginal Fermi liquid at the border of three- 
dimensional ferromagnetism, it should scale as 
T°?. At first sight, the variation of the electri- 
cal resistivity with temperature in the material 
studied by Smith et al., the weakly ferromag- 
netic metal ZrZn,, seemed to be inconsistent 
with Fermi-liquid theory and the marginal 
Fermi liquid. But the temperature dependence 
of the resistivity is an extremely subtle property 
as it also depends on the angular distribution of 
the scattering processes. Smith and colleagues 
therefore used a simple trick to determine the 
quasiparticle lifetime more accurately. They 
compared the temperature dependence of the 
electrical resistivity with that of the thermal 


conductivity for ultra-pure specimens, and 
found that the quasiparticle lifetime was char- 
acteristic of the marginal Fermi-liquid state; in 
the marginal state, the quasiparticles live just 
long enough to be recognized as particles but 
no longer; that is, the marginal state is almost 
not normal (Fig. 1). 

Finally, perhaps the most peculiar aspect 
of Smith and colleagues’ work is the anoma- 
lous temperature dependence of the resis- 
tivity under pressures above p, (the critical 
pressure at which ferromagnetism is entirely 
suppressed). The authors explain this anoma- 
lous behaviour in terms of intrinsic magnetic 
heterogeneities that may arise in the first-order 
magnetic transition at p, (ref. 11). Yet below 
Pc the marginal Fermi-liquid behaviour does 
not seem to be affected by a small magnetic 
moment, which above p- suddenly affects 
the resistivity strongly. This asymmetry goes 
along with the authors’ surprising remark that 
the quantum critical point in Pd,_.Ni, —a 
highly heterogeneous system that develops 
giant magnetic moments at the critical Ni 
concentration x, (ref. 12) of a paramagnetic 
to ferromagnetic quantum critical point — 
also displays the same marginal Fermi-liquid 
behaviour of a pure material. 


Keeping in mind that the entire analysis of 
the marginal state is based on the assumption 
that the response of the spin dynamics can be 
described by linear-response theory, one may 
speculate on whether the marginal state needs 
a little help to show its ‘face’. This help may 
be a small ferromagnetic moment or a small 
amount of structural disorder. Without it, the 
normal metallic state may be much more exotic 
than we think. a 
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STRUCTURAL BIOLOGY 


Serpins’ mystery solved 


James C. Whisstock and Stephen P. Bottomley 


Polymers of misfolded proteins underlie many diseases, including major 
neurodegenerative disorders. Structural data on how such aggregates of 
serpin proteins form answer several outstanding questions. 


Ever since Alois Alzheimer and his colleagues 
first described abnormal aggregates of the 
amyloid protein in the brain of a patient who 
died of a form of dementia now known as 
Alzheimer’s disease, biochemists have been 
fascinated by the molecular events associated 
with such protein misfolding. They hope that 
understanding the molecular basis of pro- 
tein misfolding will aid the development of 
drugs against not only Alzheimer’s disease, 
but also a range of disorders caused by mis- 
folded proteins. These are known collectively 
as conformational diseases and include prion 
diseases, Parkinson’s disease, Huntington's 
disease and several disorders known as ser- 
pinopathies’. Whereas Alzheimer’s disease 
and certain related disorders are caused by 
misfolded amyloid, serpinopathies — which 
include the respiratory disease emphysema, 
early-onset dementia and liver cirrhosis — are 
caused by polymerization of serpin proteins. 
On page 1255 of this issue, Yamasaki et al.” 
show how serpins misfold into serpinopathy- 
causing polymers. 

The serpin story started to take shape 25 years 


ago with detailed molecular and structural 
studies of three serpins’: a,-antitrypsin, a pro- 
tein that reduces the inflammatory activity of 
the protease enzyme elastase; antithrombin, a 
protease inhibitor that prevents blood coagu- 
lation; and ovalbumin, from the chicken egg 
white. More than 85 serpin structures are now 
available and, cumulatively, these indicate that 
inhibitory serpins undergo a dramatic confor- 
mational change on contact with target pro- 
teases". Notably, protease binding results in the 
cleavage of an exposed amino-acid sequence 
in the serpin molecule called the reactive-cen- 
tre loop (RCL), allowing it to become an addi- 
tional B-strand (s4A) in the central B-sheet 
— the A-sheet — of the same serpin. Certain 
serpins (for example, plasminogen activator 
inhibitor-1) can spontaneously undergo a 
similar conformational rearrangement with- 
out RCL cleavage to form an inactive latent 
protein. Conformational change in serpins has 
been likened’ to a molecular mousetrap that 
is sprung by a target protease and is accampa- 
nied by a large increase in protein stability. 
In the mid-1980s, a sin gle-nucleotide 
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mutation in the gene encoding a,-antitrypsin 
(the Z-antitrypsin mutation) was discovered; 
the mutation results in the accumulation of 
highly stable, inactive polymers of this serpin 
in the endoplasmic reticulum of liver cells. 
Individuals with two copies of this mutated 
gene develop emphysema in early adulthood, 
and some develop liver cirrhosis in child- 
hood’. It was suggested that the mutation 
might subvert the conformational flexibility 
required for the inhibitory activity of serpins, 
and that polymerization occurred through the 
insertion of the RCL of one serpin molecule 
into the A-sheet of another, and so on. This 
‘A-sheet polymerization’ model® has remained 
an attractive explanation for the mechanism 
of serpinopathies. However, it fails to readily 
explain some essential details, such as why 
serpin polymers are highly stable, and why 
the mutations that do not affect the stability 
of the normally folded protein still polymer- 
ize during folding. To address these questions, 
solving the structure of the disease-linked 
serpin polymers has remained a challenging 
goal, which Yamasaki et al” now meet. 

The structure they present is of an anti- 
thrombin dimer, from which one can easily 
deduce how any serpin adopting a similar con- 
figuration could propagate into a long-chain 
polymer. This structure reveals a remarkable 
twist in the tale. Instead of simple single-B- 
strand linkages through RCL alone, serpins 
can undergo a structural change known as 
domain swapping, whereby a main structural 
feature — in this case, a third of the A-sheet 
— forms an equivalent region in another 
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NEWS & VIEWS 


OBITUARY 


Brian Pippard (1920-2008) 


Low-temperature physicist who excelled in subtle intuitive concepts. 


Brian Pippard, who died on 21 September, 
had many achievements, but will be 
remembered particularly as the first 
experimenter to map the Fermi surface 

of a metal, for his non-local theories 

of the response of normal metals and 
superconductors to electric fields, and as 
dynamic head of physics at the University of 
Cambridge from 1971 to 1982. 

Pippard grew up in Bristol, UK, and 
at school proved exceptional in music as 
well as science. He entered Clare College, 
Cambridge, in 1938, and, doubting his 
mathematical ability, chose first to study 
chemistry, before switching to physics to 
make himself useful during the Second 
World War. He worked at the Great Malvern 
research establishment on a radar device 
for tracking mortar projectiles, returning to 
Cambridge for his PhD in 1945. 

He was already familiar with the skin 
effect, the restriction of microwave fields 
to a thin surface layer in metals, and his 
supervisor David Shoenberg encouraged 
him to study it in superconductors — 
materials that mysteriously lose all electrical 
resistance below a sharply defined transition 
temperature. In 1935, Fritz and Heinz London 
had hinted that a superconductor might be 
a quantum fluid, a system of many electrons 
governed by a wavefunction similar to that 
of a single electron, as though they were all 
in the same quantum state. In such a system, 
the inertia of the superelectrons should lead 
to a purely inductive conductivity at high 
frequencies. By measuring the frequency and 
bandwidth of microwave resonators made 
of tin and mercury, Pippard successfully 
observed this effect when he cooled the 
resonators through their superconducting 
phase transitions. 

Pippard’s experiments were models of 
simplicity. For instance, the frequency of his 
klystron microwave source had to be stable 
to one part in a million, yet he achieved this 
using no more than draught protection, a 
micrometer tuner with a weight and pulley 
to eliminate backlash, and a stack of carefully 
aged high-voltage batteries as power supply. 

In 1940, Heinz London had noticed that 
the microwave surface resistance of normal 
metals, instead of falling steadily with falling 
temperature as the resistivity did, reached a 
limit. Pippard confirmed this ‘anomalous skin 
effect’ in his own samples and realized that it 
occurred when the electron free path became 
larger than the skin depth, and that only 
those electrons moving parallel to the sample 
surface remained effective in responding to 
the electric field. This idea led quickly to a 


l 


complete theory of the effect, published in 
1948 by Harry Reuter and Ernst Sondheimer, 
and Pippard’s simple ineffectiveness 

concept was later applied in understanding 
many other phenomena, such as cyclotron 
resonance and ultrasonic attenuation in 
metals. 

After his doctorate, Pippard became 
interested in the stability of normal metal- 
superconductor (NS) boundaries, and 
introduced the idea that their energy was 
determined by a characteristic coherence 
length. This length was later identified with 
that of the celebrated Ginzburg-Landau 
theory, and proved crucial in understanding 
the physics of type II superconductors, used 
in making superconducting magnets. 

Pippard’s work on the anomalous skin 
effect encouraged him to wonder whether, 
just as in normal metals the current at a 
given point depends on the electric field 
at neighbouring points through which 
the electrons had travelled since last being 
scattered, the superconducting current 
might likewise depend on the electric field at 
distant points — the range being limited by 
an electromagnetic coherence length. In 1953 
he proposed a model of this non-local effect. 
It proved entirely successful; and in 1957, 
when John Bardeen, Leon Cooper and Robert 
Schrieffer published their revolutionary 
theory of superconductivity, they were at 
pains to demonstrate that it confirmed 
Pippard’s model. 

In 1954, Pippard pointed out that in the 
anomalous skin effect, the surface impedance 
of a metal is related to its Fermi surface — the 
theoretical boundary in momentum-space 
within which the conduction electrons 
of a metal reside — and in fact provides a 
measure of the curvature of this surface near 
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the effective electrons. In 1955, during a 
sabbatical year at the University of Chicago, 
Pippard measured the surface resistances of 
single-crystal samples of copper in various 
orientations, and on his return to Cambridge 
used his data to map out the Fermi surface of 
copper, a widely recognized tour de force. He 
was elected to the Royal Society in 1956. 

In 1962, Pippard’s student Brian Josephson 
independently proposed a theory of quantum 
tunnelling between superconductors, which 
subsequently earned him a Nobel prize (in 
1973). Pippard was not directly involved, but 
later enthusiastically took up the study of the 
physics of SNS Josephson junctions, and of 
the peculiar phenomena that occur when 
the superconducting current is converted to 
normal current at an SN interface. During 
the same period, his book Dynamics of 
Conduction Electrons established him as 
an expert on all aspects of Fermi-surface 
phenomena, especially magnetoresistance, 
helicon waves, and magnetic breakdown in 
which electrons tunnel through momentum 
space from one part of the surface to another. 

In 1966, Pippard became the first president 
of Clare Hall, informal and happy home of 
graduates and distinguished visitors, and of 
his wife Charlotte and their three daughters. 
In 1971, he was eleded to the Cavendish 
chair in the department of physics, but even 
before that had been the moving spirit behind 
the much needed transfer of the Cavendish 
Laboratory to new buildings. 

As head of department from 1971, he 
encouraged innovative research and tried 
persistently — though not always successfully 
— to close down areas he judged to be less 
worthy. He was proactive in the reform of 
undergraduate teaching, and insisted on 
lecturing ability in staff appointments. He 
was himself always a stimulating teacher, 
and his perceptive Elements of Classical 
Thermodynamics and challenging Cavendish 
Problems in Classical Physics excited and 
tormented generations of students. He was 
president of the Institute of Physics during 
1974-76. 

Brian could be idiosyncratic. He was 
suspicious of mathematical formalism, and 
once, when invited to deliver a keynote speech 
to an international audience, selected as its 
title “The cat and the cream, and startled his 
listeners by announcing the imminent demise 
of solid-state physics. He was capable of great 
kindness, but also relished being boyishly 
clever: his inaugural lecture as Cavendish 
professor was planned around an intriguing 
series of bench experiments whose outcomes 
the assembled practitioners, young and old, 
were invited to predict, by show of hands. We 
duly got most of our predictions wrong, as he 
intended. 

John Waldram 

John Waldram was formerly at the Cavendish 
Laboratory, Cambridge CB3 OHE, UK. 

e-mail: jrw2@cam.ac.uk 
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BRIEF COMMUNICATIONS ARISING 


The fourth circuit element 


Arising from: D. B. Strukov, G. S. Snider, D. R. Stewart & R. S. Williams Nature 453, 80-83 (2008) 


In 1971, Chua suggested’ that there should in principle exist a circuit 
element linking electrical charge, q, and magnetic flux, g. Strukov et 
al. claim recently to have found such a link. However, here I point 
out that Chua’s suggestion was in fact preceded by experimental 
evidence*> for magneto-electric effects whereby magnetic and elec- 
trical signals are interconverted. When the stimulus is magnetic and 
the response is electrical, the linear magneto-electric coupling con- 
stant, o, is typically reported as dP/dH, where P represents the elec- 
trical polarization and H represents the applied magnetic field. The 
coupling constant could equally be presented as dq/dg if divided by 
the permeability of free space. 

In making their claim, Strukov et al. therefore overlook the mag- 
neto-electric literature. In magneto-electric systems, the link arises 
explicitly and through a simple geometry, whereas Strukov et al. 
consider a system in which magnetic signals are neither applied 
nor measured. Moreover, some magneto-electric systems display 
nonlinear coupling®’, which is necessary but not sufficient for mem- 
ristance’”. 

Neil D. Mathur’ 

'Department of Materials Science, New Museums Site, Pembroke Street, 
Cambridge CB2 3QZ, UK. 

e-mail: ndm12@cam.ac.uk 
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Early origins and evolution of microRNAs 
and Piwi-interacting RNAs in animals 


Andrew Grimson’, Mansi Srivastava’, Bryony Fahey”, Ben J. Woodcroft, H. Rosaria Chiang'”, Nicole King’, 
Bernard M. Degnan’, Daniel S. Rokhsar*” & David P. Bartel’? 


In bilaterian animals, such as humans, flies and worms, hundreds of microRNAs (miRNAs), some conserved throughout 
bilaterian evolution, collectively regulate a substantial fraction of the transcriptome. In addition to miRNAs, other bilaterian 
small RNAs, known as Piwi-interacting RNAs (piRNAs), protect the genome from transposons. Here we identify small RNAs 
from animal phyla that diverged before the emergence of the Bilateria. The cnidarian Nematostella vectensis (starlet sea 
anemone), a close relative to the Bilateria, possesses an extensive repertoire of miRNA genes, two classes of piRNAs and a 
complement of proteins specific to small-RNA biology comparable to that of humans. The poriferan Amphimedon 
queenslandica (sponge), one of the simplest animals and a distant relative of the Bilateria, also possesses miRNAs, both 
classes of piRNAs and a full complement of the small-RNA machinery. Animal miRNA evolution seems to have been 
relatively dynamic, with precursor sizes and mature miRNA sequences differing greatly between poriferans, cnidarians and 
bilaterians. Nonetheless, miRNAs and piRNAs have been available as classes of riboregulators to shape gene expression 


throughout the evolution and radiation of animal phyla. 


The RNA interference (RNAi) pathway, which processes long 
double-stranded RNA into small interfering RNAs and uses them 
to mediate gene silencing, is present in diverse eukaryotes, presum- 
ably with a role in transposon silencing or viral defence since early in 
eukaryotic evolution’. Building on this basal pathway, which 
includes the Dicer endonuclease and the argonaute (Ago) effector 
protein, some eukaryotic lineages have acquired additional path- 
ways, each using unique classes of small RNAs to guide silencing. 
MicroRNAs, ~21—24-nucleotide RNAs that derive from distinctive 
hairpin precursors, pair to messenger RNAs to direct their post- 
transcriptional repression’. More than one-third of human genes 
are under selective pressure to maintain pairing to miRNAs, implying 
that these riboregulators influence the expression of much of the 
transcriptome’. Piwi-interacting RNAs are longer, ~25-30 nucleo- 
tides, with incompletely characterized biogenic pathways. In mam- 
mals and flies, piRNA expression is restricted to the germ line, where 
they have crucial roles in transposon defence, although one class of 
mammalian piRNAs, highly expressed at the pachytene stage of 
sperm development, has unknown function*’. 

The plant and algal miRNAs have gene structure, biogenesis and 
targeting properties distinct from those of animals**. These differ- 
ences, considered together with the absence of miRNAs in fungi and 
all other intervening lineages examined, have led to the conclusion 
that miRNAs of animals and plants had independent origins®. Of the 
many miRNAs reported in Bilateria (Fig. 1), ~30 appear to have been 
present in ancestral bilaterians”'’; however, none have been reported 
in the earliest branching animal lineages, leading to the hypothesis 
that bilaterian complexity might, in part, be due to miRNA-mediated 
regulation''. Likewise, piRNAs have not been reported outside 
Bilateria, raising the question of whether a rich small-RNA biology 
is characteristic of more complex animals, or whether these small 
RNAs might have emerged earlier in metazoan evolution. 


Diverse microRNAs of the starlet sea anemone 


Eumetazoa includes the Bilateria as well as the Cnidaria. Among 
sequenced genomes Cnidaria is represented by the starlet sea anemone, 
Nematostella vectensis'*. To explore whether cnidarians have miRNAs, 
we sequenced complementary DNA libraries generated from 18—30- 
nucleotide RNAs isolated from Nematostella. High-throughput 
sequencing yielded 2.9 million reads perfectly matching the 
Nematostella genome (Fig. 2a). To identify miRNAs, we considered 
properties that have proved useful for distinguishing bilaterian 
miRNAs from other types of small RNAs represented in sequencing 
data’*'’. The first criterion was the presence of reads mapping to an 
inferred RNA hairpin with pairing characteristics of known miRNA 
hairpins. The second was the presence of reads from both arms of the 


Homo sapiens (human, 677 miRNAs) 

Mus musculus (mouse, 491 miRNAs) 

Drosophila melanogaster (fly, 147 miRNAs) 
Caenorhabditis elegans (nematode, 154 miRNAs) 
Schmidtea mediterranea (planarian, 61 miRNAs) 
Nematostella vectensis (sea anemone) 

Trichoplax adhaerens (placozoan) 

Amphimedon queenslandica (sponge) 

Monosiga brevicollis (choanoflagellate) 
Schizosaccharomyces pombe (yeast, 0 miRNAs) 
Saccharomyces cerevisiae (yeast, 0 miRNAs) 
Neurospora crassa (fungus, 0 miRNAs) 
Arabidopsis thaliana (flowering plant, 199 miRNAs) 
Physcomitrella patens (moss, 263 miRNAs) 
Chlamydomonas reinhardtii (green alga, 72 miRNAs) 
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Figure 1| Phylogenetic distribution of annotated miRNAs. Cladogram of 
selected eukaryotes, with organisms investigated in this study indicated in 
red. Branching order of Bilateria is according to ref. 28 and the references 
therein, and that of basal Metazoa is according to ref. 17 (Supplementary 
Discussion). Annotated miRNA tallies are from miRBase (v10.1)”’. 
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hairpin that, when paired to each other, formed a duplex with 
2-nucleotide 3’ overhangs. This duplex corresponds to an intermedi- 
ate of miRNA biogenesis in which the miRNA and opposing segment 
of the hairpin, called the miRNA*, are excised from the hairpin 
through successive action of Drosha and Dicer RNase III endonu- 
cleases*. The third criterion was homogeneity of the miRNA 5’ ter- 
minus. Because pairing to miRNA nucleotides 2-8 is crucial for target 
recognition’, reads matching bilaterian miRNAs display less length 
variability at their 5’ termini than at their 3’ termini'*””. 

As exemplified by mir-2024d (Fig. 2b, c), 40 distinct Nematostella 
loci met these criteria (Fig. 2d and Supplementary Data 1; identical 
hairpins were not counted because they might have arisen from 
genome-assembly artefacts). Additional features, not used as selec- 
tion criteria, resembled those of bilaterian miRNAs’, thereby increas- 
ing confidence in our annotations. For example, the loci usually 
mapped between annotated protein-coding genes (31 loci) or within 
introns in an orientation suitable for processing from the pre-mRNA 
(8 loci). The Nematostella miRNAs also had a tight length distri- 
bution (centring on 22 nucleotides, Fig. 2d), and five groups of 
miRNAs (corresponding to 13 miRNAs) mapped near to each other 
in an orientation suitable for production from the same primary 
transcript (Supplementary Data 1), as occurs in bilaterians*. With 
the exception of two miRNA pairs (miR-2024a,b and miR-2024f,d), 
the Nematostella miRNAs had unique sequences at nucleotides 2-8, 
suggesting notable diversity of miRNA targeting in this simple ani- 
mal. 

Previous studies that explored the possibility that cnidarians might 
have miRNAs searched for Nematostella homologues of the ~30 
miRNA families broadly conserved within the Bilateria by probing 
RNA blots and examining candidate hairpin sequences'"'*. These 
studies reported the possible presence of miR-10, miR-33 and miR- 
100 family members in Nematostella. None of our reads matched the 
proposed miR-10, miR-33 or miR-100 homologues, and none 
matched the proposed hairpin precursors of miR-10 or miR-33. 
Such discrepancies were not unexpected, because detection of dis- 
tantly related miRNAs by hybridization is prone to false-positives, 
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and many genomic sequences can fold into hairpins. However, one 
of the newly identified miRNAs arose from the hairpin of the reported 
miR-100 homologue. The actual miRNA was offset by one nucleotide 
compared to bilaterian miR-100 family members (Fig. 2e). Because 
miRNA-targeting is defined primarily by nucleotides 2-8, this offset is 
expected to alter target recognition substantially, with the 
Nematostella version primarily recognizing mRNAs containing 
CUACGGG and UACGGGA heptanucleotide sites and the bilaterian 
versions recognizing mRNAs with two different sites, VACGGGU and 
ACGGGUA’. 

Despite this wholesale shift in their predicted targeting, the 
Nematostella and bilaterian versions of miR-100 had similarity 
throughout the RNA, suggesting common origins (Fig. 2e). This 
result confidently extended the inferred origin of metazoan 
miRNAs back to at least the last common ancestor of these eume- 
tazoans. Systematic comparison to annotated miRNAs did not reveal 
any additional Nematostella miRNAs with similarity exceeding that 
of shuffled control sequences (Supplementary Fig. 1). Although the 
short length of miRNAs may cause sequence divergence to obscure 
common ancestry, it is noteworthy that only one of the 40 
Nematostella miRNAs appeared homologous to extant bilaterian 
miRNAs, and even this one seemed to have profoundly different 
targeting properties. 


MicroRNAs near the base of the metazoan tree 


To determine whether miRNAs might be present in more deeply 
branching lineages, we generated 2.5 million genome-matching reads 
from the small RNAs of the demosponge A. queenslandica, a pori- 
feran thought to represent the earliest diverging extant animal lin- 
eage'®'” (Figs 1 and 3a). Eight miRNA genes were identified in 
Amphimedon adult and embryo samples (Fig. 3b and Supple- 
mentary Data 2), exemplified by mir-2018 (Fig. 3c). Six mapped 
between annotated protein-coding genes; two fell within introns. 
As is typical for bilaterian miRNAs* and is also found in 
Nematostella (Fig. 2d), reads from one arm of the hairpin usually 
greatly exceeded those from the other arm, enabling unambiguous 
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annotation of the miRNA and miRNA* (Fig. 3b). However, the 
number of reads from the two arms of the mir-2015 hairpin did 
not differ substantially, suggesting that each might have similar pro- 
pensities to enter the silencing complex and target miRNAs. 
Moreover, the species from the 3’ arm (miR-2015-3p) dominated 
in adult tissue, whereas the one from the 5’ arm (miR-2015-5p) 
dominated in embryonic tissue (Fig. 3d), supporting the notion that 
this single hairpin produces two distinct miRNAs, and implying an 
intriguing, developmentally controlled differential loading into the 
silencing complex. 

In Amphimedon, pre-miRNA hairpins were larger than most of 
those of other metazoans (Fig. 3e). The Nematostella pre-miRNAs 
(including mir-100) fell at the other end of the spectrum, with a 
median length less than that of bilaterian pre-miRNAs (Fig. 3e). 
None of the Amphimedon miRNAs shared significant similarity with 
any previously described miRNAs (Supplementary Fig. 1), or with 
the miRNAs found in Nematostella. This observation, combined with 
their unusually large pre-miRNA hairpins, raised the possibility of an 
origin independent from that of eumetazoan miRNAs. Arguing 
against this possibility, we found Amphimedon homologues of 
Drosha and Pasha proteins (Table 1), which recognize the miRNA 
primary transcript and cleave it to liberate the pre-miRNA hairpin". 
Homologues of these proteins appeared to be absent in all lineages 
outside the Metazoa, indicating a single origin for these processing 
factors early in metazoan evolution and implying a single origin for 
their miRNA substrates. 

A third animal lineage branching basal to the Bilateria is Placozoa, 
represented by the sequenced species Trichoplax adhaerens'’. 
Although earlier analyses of mitochondrial genes suggested that 
Trichoplax diverged before Amphimedon, genomic data indicate that 
Trichoplax had a common ancestor with cnidarians and bilaterians 
more recently than with Amphimedon”’ (Fig. 1 and Supplementary 
Discussion). Our study of Trichoplax small RNAs failed to find 
miRNAs, despite acquiring many more reads than required to 
identify miRNAs in all other animals and plants examined 
(Supplementary Figs 2 and 3). Thus, despite the formal possibility 
that Trichoplax miRNAs are expressed at levels so low that we failed to 
detect them, we favour the hypothesis that all miRNA genes have 
been lost in this lineage. Trichoplax is thought to have derived from a 
more complex ancestor, having lost, for example, the hedgehog and 
Notch signalling pathways'’. Supporting our hypothesis, no Pasha 
homologue was found in the Trichoplax genome, although we did 
find the core RNAi proteins—argonaute and Dicer—suggesting the 
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production and use of small interfering RNAs (Table 1). Drosha, 
which partners with Pasha during miRNA biogenesis'*, was found 
also but might be required in the absence of miRNAs for ribosomal 
RNA maturation”’. Of the proteins involved in canonical miRNA 
biogenesis, Pasha is the one without known functions outside the 
miRNA pathway, and it was the one that appeared to have been 
discarded, together with all miRNAs, from the Trichoplax genome 
(Table 1). 

We also sequenced small RNAs from the single-celled organism 
Monosiga brevicollis (Supplementary Fig. 2), which represents the 
closest known outgroup to the Metazoa”’. We failed to detect any 
plausible miRNAs, a result consistent with our subsequent finding 
that Monosiga seems to lack all genes specific to small-RNA biology 
(Table 1). The absence of Dicer and argonaute seemed to be derived 
rather than ancestral, as the common ancestor of Monosiga and 
metazoans possessed these core RNAi proteins' (Table 1). The pos- 
sibility that the absence of miRNAs in Monosiga might likewise be 
derived prevented us from setting an early bound on the origin of 
metazoan miRNAs. 

In summary, miRNAs appear to have been available to shape gene 
expression since at least very early in animal evolution. Nonetheless, 
the numbers identified in simpler animals (8 unique miRNAs in 
Amphimedon and 40 in Nematostella) were lower than those reported 
in more complex animals (Fig. 1). Although miRNAs expressed only 
under specific conditions or at restricted developmental stages were 
possibly missed in these and other animals, our results are consistent 
with the idea that increased organismal complexity in Metazoa cor- 
relates with the number of miRNAs and presumably with the number 
of miRNA-mediated regulatory interactions. 


Piwi-interacting RNAs in deeply branching animals 

We next turned to the possibility that piRNAs also might have early 
origins. Piwi proteins, the effectors of bilaterian piRNA pathways, are 
found in diverse eukaryotic lineages (although not in plants or fungi, 
Table 1), implying their presence in early eukaryotes’. In cases char- 
acterized, however, the small RNAs associated with non-metazoan 
Piwi proteins resemble siRNAs more than bilaterian piRNAs (deriv- 
ing, for example, from Dicer-catalysed cleavage of long double- 
stranded RNA”'), raising the question of when piRNAs of the types 
found in Bilateria might have emerged. The genomes of both 
Amphimedon and Nematostella, but not that of Trichoplax, encode 
Piwi proteins (Table 1) and express many ~27-nucleotide RNAs with 
a 5'-terminal uridine (5’-U) (Figs 2a and 3a)—features reminiscent of 


a b Figure 3 | The miRNAs of Amphimedon 
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sequencing frequency from adult and embryo 
samples. e, Cumulative distributions of pre- 
miRNA lengths from miRNA transcripts of the 
species indicated. Amphimedon pre-miRNAs 
were significantly larger than those from any 
other animal species examined (P< 10 °, 
Wilcoxon rank-sum test), whereas those from 
Nematostella were significantly smaller 

(P<10 *). 
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Table 1| The small-RNA machinery of representative eukaryotes 


Species Ago Piwi Dicer Drosha Pasha  Henl 
Homo sapiens 4 4 1 1 1 1 
Drosophila melanogaster 2 3 2 1 1 i 
Caenorhabditis elegans* 5 3 1 i 1 1 
Nematostella vectensist 3 3 2 1 1 1 
Trichoplax adhaerens+ 1 ot 5 1 08 OF 
Amphimedon queenslandicat 2 3 4 i 1 2 
Monosiga brevicollis Of Ot Ot 0) 0 ot 
Saccharomyces cerevisiae ot Ot Ot 0) 0 ot 
Schizosaccharomyces pombe || i Ot 1 0 0) ot 
Arabidopsis thaliana 10 ot 4 0) 0 2 
Physcomitrella patens 6 Ot 5 0 0 1 
Chlamydomonas reinhardtii 2 Ot 3 0 0 sl 


* Omitted is a nematode-specific clade of proteins related to the Ago and Piwi protein families 
but distinct from both’. 

+ Protein sequences are listed in Supplementary Data 3. 

t Inferred loss based on presence in earlier-diverging lineages. 

§ Inferred loss based on presence in earlier-diverging lineages when assuming that Amphimedon 
diverged before Trichoplax (Supplementary Discussion). 

|| Ago and Dicer, but not Piwi, Drosha, Pasha or Hen!1, were also identified in each of the 
additional fungal species examined (Aspergillus nidulans, Neurospora crassa and Sclerotinia 
sclerotiorum). 


piRNAs in vertebrates and flies’. Moreover, 45% of Nematostella 5'-U 
27-30-nucleotide RNAs originated from only 89 genomic loci 
(together comprising 0.4% of the genome), the largest of which was 
62 kilobases, and essentially all of these small RNAs derived from one 
strand of each locus (Fig. 4a and Supplementary Table 3). In these 
respects the genomic loci producing a large fraction of the 
Nematostella reads closely resembled the loci producing bilaterian 
piRNAs, particularly the pachytene piRNAs’. We observed a similar 
clustering of genomic matches of Amphimedon 5'-U 24—30-nucleotide 
RNAs, although the loci were smaller and accounted for fewer reads 
(10% of the reads originating from 73 loci comprising 0.2% of the 
genome, Supplementary Table 4). 

Another characteristic of piRNAs is that they undergo Hen1- 
mediated methylation of their terminal 2’ oxygen”’. To test for this 
modification, we treated RNA from Nematostella and Amphimedon 
with periodate and then re-sequenced from both treated and untreated 
samples (Supplementary Fig. 4). Piwi-interacting RNAs and other 
RNAs modified at their 2’ oxygen remain unchanged with this treat- 
ment and are sequenced, whereas those with an unmodified 2’,3’ cis- 
diol are oxidized, which renders them refractory to sequencing”. In 
contrast to the Amphimedon miRNAs and many of the Nematostella 
miRNAs (Supplementary Tables 1 and 2), reads corresponding to the 
candidate piRNA clusters in both Nematostella and Amphimedon were 
not reduced after treatment (Supplementary Tables 3 and 4), indi- 
cating that their terminal 2’,3’ cis-diol was modified. This modifica- 
tion, considered together with their other features characteristic of 
vertebrate and fly piRNAs, including the length of 25-30 nucleotides, 
the 5’-U bias, and the single-stranded, clustered organization of their 
genomic matches, provided evidence that these small RNAs repre- 
sented piRNAs of Nematostella and Amphimedon. 

The piRNAs were the type of small RNAs most abundantly 
sequenced in Nematostella and Amphimedon (Figs 2a and 3a, and 
Supplementary Discussion). A similar phenomenon is observed in 
mammalian testes, in which the pachytene piRNAs greatly outnumber 
the miRNAs and initially obscured detection of a second class of 
mammalian piRNAs, which resemble the most abundant Drosophila 
piRNAs with respect to both their biogenesis and their apparent role in 
suppressing transposon activity*. Most of the Nematostella and 
Amphimedon genomic loci with clustered piRNA matches resembled 
the first class of piRNAs, in that they tended to fall outside of anno- 
tated genes (P< 10 *, Wilcoxon rank-sum test) and spawned piRNAs 
predominately from only one DNA strand (>99% and 96% from one 
strand, Nematostella and Amphimedon, respectively). To determine 
whether the second class of piRNAs might also exist in deeply branch- 
ing lineages, we analysed the sequences from periodate-treated sam- 
ples, focusing on the minority that matched annotated protein-coding 
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genes (Fig. 4b). As expected for class II piRNAs, these piRNAs did not 
have such a strong tendency to match only one strand of the DNA 
(62% and 64% antisense for Nematostella and Amphimedon, respect- 
ively). Moreover, among the predicted coding regions with the most 
matches to the piRNAs, a significant fraction (18 of 50 in Nematostella, 
P<10 *;12 of 40in Amphimedon, P = 0.03, Supplementary Tables 5 
and 6) were homologous to transposases. 

Having found small RNAs resembling bilaterian class II piRNAs 
we looked for evidence that they were generated through the same 
feed-forward biogenic pathway*”’. In this pathway, primary piRNAs 
from transcripts antisense to transposable elements pair to transpo- 
son messages and direct their cleavage. This cleavage defines the 5’ 
termini of secondary piRNAs generated from the transposon mess- 
age, and these secondary piRNAs pair to piRNA transcripts, directing 
cleavage and thereby defining the 5’ termini of additional piRNAs 
resembling the primary piRNAs. Because the primary piRNAs typ- 
ically begin with a 5’-U and direct cleavage at the nucleotide that 
pairs to position 10, the secondary piRNAs typically have an A at 
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Figure 4 | The piRNAs of basal metazoans. a, Distribution of reads 
matching a Nematostella piRNA locus. Plotted is the number of matching 
reads with 5’ nucleotide falling within each 100-nucleotide window (main 
graph) or at each nucleotide (higher-resolution inset) spanning the genomic 
region. Bars above and below the x axis indicate matches to the indicated 
strand, with black bars indicating reads with a 5’-U and red bars indicating 
the sum of all other reads. For reads also matching other genomic loci, 
counts were normalized by total genome matches. Other annotated piRNA 
loci are presented in Supplementary Tables 3 and 4. kb, kilobases. b, An 
annotated pre-mRNA corresponding to numerous small RNAs resistant to 
periodate treatment. Annotated coding segments (open boxes) and intron 
segments (black line) are indicated. The gene was homologous to 
endonuclease/reverse transcriptases of other genomes and presumed to be a 
transposase. Small RNAs with unique 5’ ends are represented by coloured 
bars above or below the transcript (sense and antisense, respectively), with 
colours indicating the read numbers (normalized to account for the number 
of transcriptome matches). Small RNAs matching splice junctions (observed 
only for sense reads) are represented by discontinuous bars, linked by 
dashed lines. Other Nematostella and Amphimedon coding regions matching 
candidate piRNAs are listed in Supplementary Tables 5 and 6. ¢, Nucleotide 
composition of periodate-resistant small RNAs matching the indicated 
strand of Nematostella or Amphimedon annotated coding regions. 
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position 10. Examination of all 27-30-nucleotide periodate-resistant 
reads antisense to Nematostella coding regions revealed a propensity 
for a 5'-U, characteristic of primary piRNAs (Fig. 4c). The sense- 
strand piRNAs lacked this 5'-U bias and instead displayed a propen- 
sity for an A at position 10 (Fig. 4c and Supplementary Fig. 5). 
Moreover, sense and antisense reads that paired to each other tended 
to have 10 base pairs formed between their 5’ ends (Supplementary 
Fig. 6). For the 24~—30-nucleotide periodate-resistant reads from 
Amphimedon, the same hallmark features of the back-and-forth, or 
ping-pong, amplification cycle for piRNA biogenesis**> were 
observed (Fig. 4c and Supplementary Fig. 6). We conclude that the 
two classes of piRNAs found previously in mammals and flies have 
existed since the origin of metazoans: the class I piRNAs, represented 
by the mammalian pachytene piRNAs, which have unknown func- 
tion during germline development, and the class II piRNAs, which 
use the ping-pong cleavage and amplification cascade to quiet 
expression of certain genes, particularly those of transposons. 
Indeed, the sequence-based transposon silencing by piRNAs, which 
by virtue of the feed-forward amplification process focuses on the 
most active transposon species, might be one of the principle drivers 
of transposon diversity in animals. 

Taken together, our results indicate that miRNAs and piRNAs, as 
classes of small riboregulators, have been present since the dawn of 
animal life, and indeed might have helped to usher in the era of 
multicellular animal life. However, metazoan miRNA evolution 
seems to have been very dynamic: all miRNAs have been lost in 
Trichoplax, and the pre-miRNAs of Porifera, Cnidaria and Bilateria 
have assumed distinct sizes. In addition, no miRNAs have recogniz- 
able conservation between poriferans, cnidarians and bilaterians, 
with only one of the Nematostella miRNAs displaying recognizable 
homology to bilaterian miRNAs, either because it is the only homo- 
logue of extant bilaterian miRNAs or because divergence has 
obscured common ancestry of other miRNAs. The wholesale shifts 
in miRNA function implied by this plasticity are congruent with the 
report that, although thousands of miRNA-target interactions have 
been maintained within each of the nematode, fly and vertebrate 
lineages, very few appear to be conserved throughout all three 
lineages*®. The plasticity of miRNA sequences over long timescales 
helps to explain why the rich small-RNA biology in basal organisms 
had escaped detection for so long. 


METHODS SUMMARY 


The M. brevicollis library was constructed as described'* and sequenced by 454 
Life Sciences. All other libraries (Supplementary Table 7) were constructed using 
an analogous method and sequenced on the Illumina platform. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Small RNA sequencing. Samples of N. vectensis (mixed developmental stages, 
including adult), A. queenslandica (adult tissue, stored in RNAlater, Ambion) 
and M. brevicollis were ground under liquid nitrogen, and then RNA was 
extracted with Trizol (Invitrogen). RNA from T. adhaerens (mixed devel- 
opmental stages, including adult) and A. queenslandica (mixed embryos, from 
cleavage stage to the larval stage*’, stored in RNAlater) was extracted directly 
with Trizol. The M. brevicollis library was constructed as described'’* and 
sequenced by 454 Life Sciences. All other libraries (Supplementary Table 7) were 
sequenced on the Illumina platform, and prepared as follows. The 18—30-nuc- 
leotide RNAs were purified from total RNA (typically 5 ug) using denaturing 
polyacrylamide-urea gels. Before purification, trace amounts of 5'-*’P-labelled 
RNA size markers (AGCGUGUAGGGAUCCAAA and GGCAUUAACGCGG- 
CCGCUCUACAAUAGUGA) were mixed with the total RNA and used to mon- 
itor this purification and subsequent ligations and purifications. The gel-purified 
RNA was ligated to pre-adenylated adaptor DNA (AppTCGTATGCCGTC- 
TTCTGCTTG-[3’-3’ linkage]-T) using T4 RNA ligase (10 units ligase, GE 
Healthcare, 10 pl reaction, 50 pmol adaptor ATP-free ligase buffer*', for 2h at 
21-23 °C). Gel-purified ligation products were ligated to a 5’-adaptor RNA 
(GUUCAGAGUUCUACAGUCCGACGAUC), again using T4 RNA ligase (as 
above, except with 20 units ligase, 15,1 reaction supplemented with 4 nmol 
ATP, 400 pmol adaptor, for 18h at room temperature). Gel-purified ligation 
products were reverse-transcribed (SuperScript II, Invitrogen, 30 ul reaction 
with the reverse transcription primer CAAGCAGAAGACGGCATA) and then 
RNA was base-hydrolysed with addition of 5 il of 1 M NaOH and incubation at 
90°C for 10 min, followed by neutralization with addition of 25 pl 1 M HEPES, 
pH7.0, and desalting (Microspin G-25 column, Amersham). The resulting 
cDNA library was amplified with the RT primer and PCR primer 
(AATGATACGGCGACCACCGACAGGTTCAGAGTTCTACAGTCCGA) for a 
sufficient number of cycles (typically ~20) to detect (SYBR Gold, Invitrogen) a 
clear band in a 90% formamide, 8% acrylamide gel, used for purification. Gel- 
purified amplicon (85-105 nucleotides) from each library was subjected to 
Illumina sequencing. The adaptor and primer sequences enabled cluster genera- 
tion on the Illumina machine and placed a binding site for the sequencing primer 
(CGACAGGTTCAGAGTTCTACAGTCCGACGATC) adjacent to the sequence 
of the small RNA. Periodate-treated libraries were generated identically, except 
total RNA was first subjected to B-elimination*’. Mock-treated libraries omitting 
periodate were constructed in parallel. 

MicroRNA identification and analysis. The N. vectensis, T. adhaerens and M. 
brevicollis genomes and predicted gene sets'*'”*° were downloaded from JGI 
(http://jgi.doe.gov); the A. queenslandica genome was a preliminary assembly’®. 
After removing the adaptor sequences, reads were collapsed to a non-redundant 
set and matched to the appropriate genome. Genome matches were clustered if 
neighbouring matches fell within either 50 nucleotides (Amphimedon, 
Nematostella) or 500 nucleotides (Amphimedon) of each other. The increased 
size of the clustering window used for the Amphimedon analysis (500 nucleo- 
tides) was necessary because the 50-nucleotide window was insufficient to 
identify all Amphimedon miRNAs, owing to the increased size of their pre- 
miRNAs (Fig. 3e). No additional miRNAs were identified in Nematostella when 
using a 500-nucleotide window. Sequences of clusters containing 17—25-nucleo- 
tide reads cloned at least twice were folded with RNAfold*’. If the most frequently 
sequenced species was located on one arm of a predicted hairpin and the region 
of the hairpin corresponding to that sequence contained =16 base pairs, the 
candidate locus was examined manually for characteristics of known miRNAs, 
using criteria described in the main text. Before comparing between adult and 
embryonic libraries (Fig. 3d), counts corresponding to each mature miRNA 
from each library were first normalized by the total number of genome-matching 
reads in that library. 

To detect possible homology between previously known miRNAs and either 
Nematostella or Amphimedon miRNAs, we searched miRBase (version 10.1) for 
miRNAs similar to our new miRNAs. Because miRNA conservation is most 
pronounced within the miRNA 5’ region”, we first identified any known and 
new miRNAs that shared a hexanucleotide within their first eight nucleotides, 
allowing two-nucleotide offsets. Because of the limited length of the search 
sequence, and the large number of miRNAs in miRBase, most Nematostella or 
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Amphimedon miRNAs shared a hexanucleotide with miRBase miRNAs. For all 
such cases, we then searched for extended similarity between the pairs of 
miRNAs. With the exception of the miR-100 relationship, no more than chance 
similarity was observed (Supplementary Fig. 1). However, we cannot rule out the 
possibility that additional homologous relationships are present but undetect- 
able. Because miRNAs are shorter than most other genetically encoded mole- 
cules, sequence divergence can more easily obscure homologous relationships, 
and although they resist changes in the seed region, which is crucial for target 
recognition, divergence in this 5’ region can be accelerated with the processes of 
sub- and neo-functionalization’’. 

Piwi-interacting RNA identification and analysis. Nematostella 27—30-nucleo- 
tide RNAs and Amphimedon 24-30-nucleotide RNAs were mapped to their 
respective genome, and at each matching locus counts were normalized, dividing 
by the number of genome matches for the sequenced RNA. Regions with both a 
high number of match-normalized reads (Nematostella: > 1,000 per 10 kilobases; 
Amphimedon: >100 per 5 kilobases) and a high diversity of read sequences 
(Nematostella): >500 different sequences per 10 kilobases; Amphimedon: >50 
different sequences per 5 kilobases) were identified; following the periodate 
experiment we further evaluated these regions, which led to the removal of four 
Amphimedon regions that had far fewer reads in the periodate-treated libraries. 
The remaining regions are listed in Supplementary Tables 3 (Nematostella) and 4 
(Amphimedon), which report the proportion of 5'-U match-normalized reads to 
each strand and the ratio of match-normalized read counts in periodate-treated 
compared to mock-treated libraries, after normalization for the number of gen- 
ome-matching reads in each library. The number of predicted transcripts'*'® 
overlapping genomic piRNA clusters (Supplementary Tables 3 and 4) was cal- 
culated and compared to the number overlapping 1,000 random sets equal in size 
and number to the piRNA clusters. Inferred protein sequences from predicted 
transcripts matching the greatest number of periodate-resistant, match-normal- 
ized reads were compared to annotated protein sequences using BLAST. 
Transcripts that were significantly similar to annotated transposons, or protein 
domains implicated as transposases (for example reverse transcriptases) were 
considered to encode transposases. A random selection of 100 predicted tran- 
scripts was searched similarly to ascertain significance (Nematostella: 3 out of 
100; Amphimedon: 6 out of 100). When mapping to annotated protein-coding 
regions (Fig. 4b), reads with both sense and antisense matches were distributed 
to both the sense and antisense tallies after weighting by the proportion of their 
sense and antisense matches. 

Cataloguing of the small RNA machinery. To identify homologues of compo- 
nents of the small RNA machinery, all established family members from H. 
sapiens, D. melanogaster, C. elegans, S. pombe and A. thaliana were used as 
BLAST query sequences against all annotated protein sequences of each species 
in Table 1. The top-ranking hits resulting from these initial searches were used 
reciprocally as query sequences against all annotated protein sequences of H. 
sapiens, D. melanogaster, C. elegans, S. pombe and A. thaliana. If the top-ranking 
hits of such reciprocal queries corresponded to an established family member, 
the query sequence was considered to be a candidate homologue. The domain 
structure of each candidate sequence was then evaluated*, and candidates lack- 
ing the diagnostic domains were discarded. The diagnostic domains used were a 
Paz and a Piwi domain (for Ago and Piwi family members), two RNase III 
domains (Dicer and Drosha), a double-stranded RNA-binding domain 
(Pasha) and a methylase domain (Hen1). 
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Activity-dependent regulation of 
inhibitory synapse development by Npas4 


Yingxi Lin’, Brenda L. Bloodgood’, Jessica L. Hauser't, Ariya D. Lapan’, Alex C. Koon't, Tae-Kyung Kim’, 


Linda S. Hu’, Athar N. Malik’ & Michael E. Greenberg’ 


Neuronal activity regulates the development and maturation of excitatory and inhibitory synapses in the mammalian brain. 
Several recent studies have identified signalling networks within neurons that control excitatory synapse development. 
However, less is known about the molecular mechanisms that regulate the activity-dependent development of GABA 
(y-aminobutyric acid)-releasing inhibitory synapses. Here we report the identification of a transcription factor, Npas4, that 
plays a role in the development of inhibitory synapses by regulating the expression of activity-dependent genes, which in turn 
control the number of GABA-releasing synapses that form on excitatory neurons. These findings demonstrate that the 
activity-dependent gene program regulates inhibitory synapse development, and suggest a new role for this program in 
controlling the homeostatic balance between synaptic excitation and inhibition. 


Sensory experience controls multiple steps in the development and 
maturation of synapses in the mammalian brain’*. Many of the 
effects of neuronal activity are mediated by the release of glutamate 
at excitatory synapses and the subsequent influx of calcium (Ca**) 
into the postsynaptic neuron. This results in changes in the number 
and strength of synapses, a process that underlies learning and mem- 
ory as well as animal behaviour. 

Neurons in the central nervous system receive excitatory synaptic 
input from glutamatergic neurons and inhibitory input from GABA- 
releasing (GABAergic) interneurons, except during early develop- 
ment when the first GABAergic synapses are depolarizing and 
provide the excitatory drive critical for the subsequent development 
of glutamatergic synapses. The proper balance between excitatory 
and inhibitory synapses is crucial for representation of sensory 
information®’, execution of motor commands*” and higher-order 
cognitive functions'®. Neurological disorders such as autism, schizo- 
phrenia and epilepsy are associated with an imbalance between excit- 
atory and inhibitory synapses'’'*. The number or strength of 
excitatory synapses can be modified in response to changes in acti- 
vity, and the molecular mechanisms of these processes have been 
extensively investigated'*"*. Less is known about the activity- 
dependent regulation of inhibitory synapses. 

The density of inhibitory synapses in brain regions such as primary 
sensory cortex, hippocampus and cerebellum is regulated by the level 
of excitatory synaptic activity and sensory input'”~*. In addition, 
initiation of the critical period for synaptic plasticity in the visual 
cortex is dependent on visual activity and strongly influenced by the 
maturation of inhibitory synapses”, suggesting that the activity- 
dependent regulation of GABAergic synapses is important for the 
plasticity of the nervous system. Finally, recent studies indicate that 
regulation of GABAergic synapses in response to neuronal activity 
may be a critical component of the homeostatic mechanism that 
maintains a balance between excitation and inhibition in the face 
of fluctuations in the level of sensory input into neural circuits”. 
Despite the accumulating evidence that neuronal activity regulates 


the development and maintenance of inhibitory synapses, the 
molecular mechanisms that control these processes remain to be 
characterized. 

Here we identify a transcription factor, Npas4, that is critical for 
activity-dependent regulation of GABAergic synapse development. 
Npas4 expression is rapidly activated by excitatory synaptic activity 
and turns on a program of gene expression that triggers the forma- 
tion and/or maintenance of inhibitory synapses on excitatory neu- 
rons. These findings provide a molecular link between neuronal 
excitation and GABAergic synapse development, and suggest a new 
role for the activity-dependent gene program in controlling inhib- 
itory synapse formation/maintenance on excitatory neurons. 


Npas4 is regulated by neuronal activity 


The formation of inhibitory synapses onto excitatory neurons is 
regulated by neuronal activity, takes place over several days, and is 
a cell-wide process that results in the formation of synapses onto both 
the cell body and dendrites'*”*. These features led us to hypothesize 
that activity-dependent development of inhibitory synapses might be 
controlled postsynaptically by one or more activity-regulated genes. 
To test this hypothesis, we used DNA microarrays to identify genes 
that are induced by membrane depolarization in mouse cortical 
neurons at the time when inhibitory synapses are developing. 

We identified more than 300 genes whose expression levels were 
altered upon membrane depolarization (Gene Expression Omnibus 
accession number GSE11256), a third of which were novel activity- 
regulated genes not seen in previous screens**”’. We looked for genes 
predicted to encode transcription factors, reasoning that, through 
genome-wide characterization of the targets of an activity-regulated 
transcription factor that controls inhibitory synapse number, we 
could gain insight into the biological program that is important for 
inhibitory synapse development. Among the approximately 20 
known or putative transcription factors identified, we focused on 
genes that are selectively induced by Ca’* influx in neurons but 
not other cell types, that are transcribed in response to excitatory 
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synaptic activity, and that are expressed coincidently with the 
development of inhibitory synapses. One transcription factor, the 
bHLH-PAS family member Npas4 (refs 28-31), fulfilled all these 
criteria (Fig. 1) and was investigated further. 

Unlike other activity-dependent transcription factors such as CREB 
and c-fos, Npas4 expression in neurons is selectively induced by Ca** 
influx but not by several neurotrophic factors, growth factors or for- 
skolin, an activator of protein kinase A (Fig. 1b). Furthermore, Npas4 
induction is transient (Fig. 1c), occurs selectively in neurons, and 
predominately in excitatory neurons (data not shown). Npas4 
expression is barely detectable in immature primary hippocampal 
neurons, when there are few synapses”, and increases with the forma- 
tion and maturation of synapses” (Fig. 1d). 
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Figure 1| Npas4 expression is regulated by neuronal activity in vitro and in 
vivo. a, Immunostaining showing Npas4 protein is induced in rat 
hippocampal neurons (7 DIV) by depolarization (50 mM KCl, 2h, right). 
p-CREB, CREB phosphorylated at Ser 133. b, Western blot showing Npas4 is 
selectively induced by membrane depolarization (50 mM KCl, 7 DIV rat 
hippocampal neurons), but not by BDNF (50 ngml_'), NT3 (50ngml7'), 
NT4 (50 ngml ’), forskolin (10 1M), NGF (100 ngml '), EGF 
(100ngml '), PDGF (100ngml 1), CNTF (100 ng ml ') or IGF-1 

(100 ngml '). Induction is prevented by pretreatment with the Ca** 
chelator EGTA (5 mM, 10 min). ¢, Western blot showing Npas4 (7 DIV rat 
hippocampal neurons) is transiently induced by membrane depolarization 
(50 mM KCI, 30 min). d, Basal Npas4 expression increases as neurons 
mature, presumably because of increased endogenous spontaneous activity: 
compare immunostaining of 7 and 14 DIV rat hippocampal neurons. 

e, Western blot showing stimulation of primary hippocampal neurons 

(14 DIV) with bicuculline (50 11M, 2 h) increases Npas4 expression levels. 
This is prevented by pretreatment with nimodipine (5 UM, 1h) or EGTA 
(5 uM, 5 min) and reduced by pretreatment (1 h) with antagonists to NMDA 
receptors (100 uM AP5) or AMPA receptors (50 LM CNQX). f, g, Mice dark- 
reared for one week (P21—P28) and then stimulated with strobe lights have 
greater Npas4 expression levels in the visual cortex than their dark-reared 
littermates, but there is no difference in the hippocampus. Light stimulation 
was applied for 2h for immunocytochemistry analysis (f) or 1h for Npas4 
mRNA quantification (g). Significance was determined using a one-tailed 
paired t-test, *P < 0.05. Data are shown as mean + s.e.m. 
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Npasé is induced in cultured neurons by the GABA,-receptor ant- 
agonist bicuculline, which increases action-potential firing and excit- 
atory synaptic transmission (Fig. le). This induction requires an 
influx of extracellular Ca** through L-type voltage-sensitive calcium 
channels (L-VSCCs) and is partly dependent on the activation of 
N-methyl-p-aspartate (NMDA) and «-amino-3-hydroxy-5-methyl- 
4-isoxazole propionic acid (AMPA) receptors (Fig. le). Npas4 
expression is also induced in pertinent brain regions in vivo in res- 
ponse to specific stimuli: visual stimulation of mice after a period of 
dark-rearing results in an increase in Npas4 messenger RNA (mRNA) 
and protein levels specifically in the visual cortex (Fig. 1f, g). 


Npas4 regulates the development of inhibitory synapses 


The effect of Npas4 on inhibitory synapse development was investi- 
gated by RNA interference (RNAi)-mediated knockdown in excit- 
atory neurons of rat dissociated hippocampal cultures, before 
synapse formation was underway. A small hairpin RNA targeting 
Npas4 (Npas4-RNAi) effectively reduced Npas4 expression (Fig. 2a) 
without affecting the overall health of the neurons (Supplementary 
Fig. 1). To measure inhibitory synapse number, neurons were immu- 
nostained for presynaptic GABA-producing enzyme GAD65 and 
postsynaptic GABA,-receptor y2 subunit (GABA,-y2). Co-local- 
ization of GAD65 and GABA,-y2 puncta on a green fluorescent 
protein (GFP)-transfected glutamatergic neuron was considered 
indicative of a synapse (Fig. 2b). Expression of Npas4-RNAi, but 
not a scrambled RNAi (control-RNAi), significantly reduced the 
number of inhibitory synapses (Fig. 2c). Analysis of synapse number 
using antibodies that recognize a second pair of inhibitory synaptic 
proteins (GAD67 and GABA,-2/3) gave a similar result (data not 
shown). These data suggest that Npas4 positively regulates the num- 
ber of inhibitory synapses that form on excitatory neurons. 
Different classes of inhibitory neurons synapse onto distinct peri- 
somatic or dendritic regions of pyramidal neurons*’. We found that 
Npas4-RNAi leads to a reduction in the number of inhibitory 
synapses formed on both the perisomatic and dendritic regions, sug- 
gesting that Npas4 regulates the number of inhibitory synapses 
formed by multiple classes of inhibitory neurons (Fig. 2c). In both 
regions, Npas4-RNAi significantly reduced the density of post- 
synaptic GABA,-y2 puncta but had less effect on presynaptic 
GAD65 puncta, compared with the control-RNAi (Fig. 2d). These 
findings suggest that Npas4 regulates inhibitory synapse number by 
controlling the number of postsynaptic specializations, resulting in 
subsequent remodelling or retraction of the presynaptic terminals. 
To test whether Npas4 is important for the development of func- 
tional inhibitory synapses in a more intact neural circuit, rat organo- 
typic hippocampal cultures were biolistically co-transfected with GFP 
and either a control vector or Npas4-RNAi, and whole-cell recordings 
were performed on GFP-positive CAl pyramidal neurons to measure 
spontaneous miniature inhibitory postsynaptic currents (mIPSCs, 
Fig. 3a). Knocking down Npas4 expression in organotypic cultures 
significantly increased the inter-event interval and decreased the ampli- 
tude of mIPSCs (Fig. 3b, c). This effect of Npas4-RNAi in organotypic 
slices may be indicative of a decrease in inhibitory synapse number, 
consistent with our observation that Npas4-RNAi reduces inhibitory 
synapse number in dissociated cultures. Together, these findings indi- 
cate that Npas4 plays an important role in regulating the number of 
functional inhibitory synapses received by an excitatory neuron. 
Consistent with a role for Npas4 in inhibitory synapse development, 
Npas4 knockout mice (Npas4~/~, Supplementary Fig. 2) appear 
anxious and hyperactive, are prone to seizures and have a shortened 
lifespan compared with their wild-type littermates, phenotypically 
resembling other knockout mice lacking genes that control the forma- 
tion or function of inhibitory circuits***°. However, we found that the 
frequencies of mIPSCs in acute hippocampal slices prepared from 
wild-type and Npas4‘~ mice were similar (Supplementary Fig. 3), 
in contrast to the clear change in mIPSC frequency caused by disrupt- 
ing Npas4 expression acutely in organotypic slices. This difference may 
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Figure 2 | Npas4 regulates the number of GABAergic synapses in cultured 
hippocampal neurons. a, Npas4-RNAi, but not control-RNAi, reduces the 
expression of Npas4 in primary hippocampal neurons. Cultures were 
transfected at 6 DIV and stimulated with bicuculline (50 1M, 2h) at 14DIV. 
b, The number of GABAergic synapses is significantly reduced by Npas4- 
RNAi, as illustrated by two representative rat hippocampal neurons. Cultures 
were co-transfected (6 DIV) with GFP and either Npas4-RNAi (top) or 
control-RNAi (bottom). Cultures were subsequently immunostained 

(25 DIV) with antibodies against GAD65 (blue) and GABA,-y2 (red). 

c, Quantification of the normalized density of co-localized GABA,-y2 and 


be due to the activation of compensatory pathways during develop- 
ment in the absence of Npas4. Alternatively, it may reflect the fact that 
Npas4 is absent in both pre- and postsynaptic cells in acute slices from 
Npas4 ‘~ mice, whereas in the experiments with organotypic slices the 
disruption of Npas4 expression occurs in postsynaptic neurons only. 

To determine conclusively the involvement of Npas4 in inhibitory 
synapse development, we generated an Npas4 conditional knockout 
mouse (Npas4"“"*) in which the coding region of Npas4 is flanked by 
loxP sites and can be acutely removed by Cre-mediated recombina- 
tion (Supplementary Fig. 4). Organotypic hippocampal slices were 
prepared from Npas4**"* mice and whole-cell recordings made from 
CA1 pyramidal neurons transfected with GFP and either a control 
vector or a vector encoding Cre recombinase. Cre expression had no 
effect on mIPSCs in wild-type neurons (Supplementary Fig. 5). 
However, compared with transfection with the control construct, 
transfection of Npas4"’"* neurons with Cre led to a significant 
increase in the mIPSC inter-event interval (Fig. 3d, e). This finding 
provides further evidence that CA1 pyramidal neurons lacking Npas4 
receive fewer inhibitory synaptic inputs. 

We next investigated whether the number of inhibitory synapses 
forming onto a cell is controlled by the amount of Npas4 expressed in 
response to excitatory stimuli. If this is the case, increasing the level of 
Npas4 should lead to an increase in the number of inhibitory 
synapses formed on the Npas4-expressing neuron. Additional 
copies of the Npas4 gene were introduced into neurons using an 
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GAD65 puncta in 14 DIV rat hippocampal neurons transfected with vector 
control, Npas4-RNAi or control-RNAi. d, Separate quantification of 
perisomatic and dendritic GABA,-y2 and GAD65 puncta measured in 

c. e, Npas4-minigene increases the density of co-localized GABA,-y2 and 
GAD65 puncta. f, Separate quantification of perisomatic and dendritic 
GABAa-y2 and GAD65 puncta measured in e. See Methods for details of data 
normalization and error propagation. Significance was determined using 
multifactorial analysis of variance. *P < 0.05; **P < 0.005; ***P < 0.0005. 
Data are presented as mean = s.e.m. from three (¢, d) or four (e, f) independent 
experiments; total numbers of neurons analysed (1) are indicated. 


Npas4-minigene cassette consisting of all Npas4 introns and exons 
as well as 5 kilobases (kb) of genomic sequence 5’- and 3’- to the 
coding region. We verified that the Npas4-minigene drives ectopic 
expression of Npasé4 that is activity regulated and functions similarly 
to the endogenous gene (Supplementary Fig. 6). We found that sig- 
nificantly more inhibitory synapses were formed onto cultured hip- 
pocampal neurons expressing the Npas4-minigene than onto 
neurons transfected with a control vector (Fig. 2e), largely because 
of an increase in the number of postsynaptic GABA,-y2 puncta 
(Fig. 2f). Furthermore, expression of the Npas4-minigene in CA1 
pyramidal neurons of hippocampal slices significantly decreases the 
mIPSC inter-event interval and significantly increases the amplitude 
(Fig. 3b, c), consistent with the presence of additional and stronger 
inhibitory synapses on neurons expressing higher levels of Npas4. 


Effect of Npas4 on excitatory synapses 


The appropriate balance between excitation and inhibition is critical 
for the function of neural circuits. To maintain this balance, changes 
in the number or strength of inhibitory synapses are often coupled to 
changes in excitatory synapses”. We next asked whether excitatory 
synapse number or function is also affected when Npas4 expression is 
perturbed. 

We first determined whether Npas4 regulates the number of excit- 
atory synapses in dissociated hippocampal cultures. As before, neu- 
rons were transfected with either Npas4-RNAi or control constructs, 
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before synaptogenesis was underway and thus before the balance 
between excitation and inhibition was established. Cultures were 
immunostained for the presynaptic marker synapsin! and the excit- 
atory postsynaptic marker PSD95, and the numbers of co-localized 
synapsin1l and PSD95 puncta on transfected glutamatergic neurons 
were quantified. Neither the total numbers of excitatory synapses nor 
the individual numbers of pre- or postsynaptic markers changed 
significantly upon expression of Npas4-RNAi (Fig. 4a, b), under 
conditions that significantly decreased the number of inhibitory 
synapses. Likewise, expression of the Npas4-minigene had no effect 
on excitatory synapse number (Fig. 4c, d). Because perturbation of 
the level of Npas4 expression occurred before synaptic connections 
were established, these experiments suggest that Npas4 is not a major 
contributor to excitatory synaptogenesis. 

We next examined whether Npas4 affects excitatory synapse func- 
tion in organotypic hippocampal slices, where homeostatic mechan- 
isms are known to control excitatory/inhibitory balance within 
neural circuits. Deletion of the Npas4 gene from neurons in 
Npas4""* organotypic hippocampal slices by Cre recombination 
significantly decreased the inter-event interval of spontaneous mini- 
ature excitatory postsynaptic currents (mEPSCs, Fig. 4e, f), whereas 
elevating the level of Npas4 with the Npas4-minigene significantly 
increased the inter-event interval and decreased the amplitude of 
mEPSCs (Fig. 4g, h), compared with control-transfected neurons. 
Thus, abolishing Npas4 expression in a pyramidal neuron increases, 
whereas enhancing the level of Npas4 decreases, the number and/or 
presynaptic release probability of excitatory synapses that form on 
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Figure 3 | Npas4 regulates GABAergic synapse development in 
organotypic hippocampal slices. a, Representative mIPSCs recorded from 
CA1 pyramidal neurons in organotypic hippocampal slices biolistically co- 
transfected with GFP and either vector control, Npas4-RNAi or Npas4- 
minigene. b, Cumulative distributions of mIPSC inter-event intervals and 
amplitudes recorded from neurons transfected with vector control, Npas4- 
RNAi or Npas4-minigene. c, Mean = s.e.m. of data from b. mIPSC inter- 
event intervals: 2986.3 + 105.7, 3803.0 + 136.9 and 1776.9 + 75.1 ms; 
amplitudes: 31.5 = 1.1, 25.7 + 0.8 and 34.1 + 1.3 pA; for vector control, 
Npas4-RNAi and Npas4-minigene, respectively. d, Cumulative distributions 
of mIPSC inter-event intervals and amplitudes recorded from Npas4™"™* 
neurons co-transfected with GFP and either vector control or Cre 
recombinase. e, Mean + s.e.m. of data from d. mIPSC inter-event intervals: 
684.2 + 31.2 and 917.4 + 37.7 ms; amplitudes: 27.5 + 0.7 and 27.9 + 0.6 pA; 
for vector control and Cre, respectively. Total numbers of neurons analysed 
in each condition (1) are indicated in ¢c and e. *P < 0.05; ***P < 0.001. 
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the neuron. Therefore, the net result of Npas4 activation in an intact 
neural circuit is an increase in synaptic inhibition and a decrease in 
excitation of a neuron. We conclude that Npas4 induction in res- 
ponse to increased excitatory input acts to reduce the level of activity, 
and therefore may function as a negative feedback mechanism to 
maintain the homeostatic balance between excitation and inhibition. 


Npas4 regulates genes that control inhibitory synapse 
development 

To uncover the program of gene expression controlled by Npas4, we 
acutely knocked down Npas4 expression in a high percentage of wild- 
type neurons using a lentivirus expressing Npas4-RNAi 
(Supplementary Fig. 7) and performed a second DNA microarray 
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Figure 4 | Npas4 has no effect on excitatory synaptogenesis but affects 
excitatory/inhibitory balance in neural circuits. a, The number of 
excitatory synapses is not affected by Npas4-RNAi. Quantification of the 
normalized density of co-localized PSD95 and synapsin! puncta in 14 DIV 
rat hippocampal neurons transfected with vector control, Npas4-RNAi or 
control-RNAi. b, Separate quantification of PSD95 and synapsin1 puncta 
measured in a. c, Npas4-minigene has no effect on the density of excitatory 
synapses. Quantification of co-localized synapsin1 and PSD95 puncta is 
shown. d, Separate quantification of PSD95 and synapsin1 puncta measured 
in c. In a—d, data are presented as mean + s.e.m. e, Cumulative distribution 
of mEPSC inter-event intervals and amplitudes recorded from Npas4™"™ 
neurons co-transfected with GFP and either vector control or Cre. 

f, Mean + s.e.m. of data from e. mEPSC inter-event intervals: 

2581.4 + 104.1 ms and 2140.5 + 79.7 ms; amplitudes: 12.1 + 0.4 and 

12.4 + 0.3 pA; for vector control and Cre, respectively. g, Cumulative 
distribution of mEPSC inter-event intervals and amplitudes recorded from 
neurons transfected with GFP and either vector control or Npas4-minigene. 
h, Mean + s.e.m. of data from g. mEPSC inter-event intervals: 2686.4 + 89.3 
and 3320.7 + 117.2 ms; amplitudes: 14.3 = 0.3 and 12.0 + 0.3 pA; for vector 
control and Npas4-minigene, respectively. Total numbers of neurons 
analysed in each condition (1) are indicated. a, b, Three independent 
experiments; ¢, d, four independent experiments. *P < 0.05; ***P < 0.001. 
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experiment to identify activity-regulated genes that are misregulated 
in the absence of Npas4. The expression levels of 327 microarray 
probe sets representing 270 unique genes were significantly different 
in cultures expressing Npas4-RNAi compared with those expressing 
the control virus (Gene Expression Omnibus accession number 
GSE11258, Supplementary List). The expression of 182 of these genes 
was also acutely regulated by membrane depolarization in the 
absence of Npas4-RNAi, indicating that many putative Npas4- 
regulated genes are activity regulated (Fig. 5a). Although Npas4 has 
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Figure 5 | Npas4 controls a program of gene expression that regulates 
GABAergic synapses. a, Hierarchical clustering of 327 probe sets (270 
putative Npas4 target genes) based on their expression profiles using 
dChip”’. The expression level of each probe set is normalized to a mean of 0 
and a standard deviation of 1. Expression values are displayed within the 
range [—3, 3] with levels above, equal to or below the mean displayed in red, 
white and blue, respectively. Dark red represents 3 or higher, and dark blue 
—3 or lower. b, Biological functions of 270 putative Npas4 target genes based 
on Gene Ontology information provided by Affymetrix (http:// 
www.affymetrix.com). ¢, BDNF expression is reduced by Npas4-RNAi 

(1.9 + 0.11-fold reduction, P < 0.01, one-tailed paired t-test). Mean = s.e.m. 
from three independent experiments is shown, and each data point was 
generated by averaging the hybridization intensity of two BDNF probe sets 
(1422168_a_at and 1422169_a_at). d, BDNF levels are consistently reduced 
in neurons from Npas4 /~ mice compared with their wild-type littermates 
(58.46 + 6.49% decrease, 95% confidence interval 30.5-86.4%). Cortical 
cultures prepared from Npas4*’* and Npas4 /~ littermates (7 DIV) were 
stimulated with KCl (55 mM), and BDNF mRNA levels were measured by 
quantitative reverse transcriptase PCR using primers in BDNF coding 
region. Three littermate pairs from three different litters were analysed; data 
(mean = s.e.m.) from one representative pair is shown. e, Npas4 interacts 
directly with BDNF promoters I and IV in an activity-dependent manner, as 
shown by chromatin immunoprecipitation. The Npas4 promoter is used as a 
positive control. Data are normalized to a control region on chromosome 3 
and are presented as mean = s.e.m. from five independent experiments. 
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been shown to function as a transcriptional activator 
(Supplementary Fig. 6b)**, we found that many genes are negatively 
regulated by Npas4 (Fig. 5a). This may reflect the fact that Npas4 
functions as a transcriptional repressor as well as an activator, and/or 
that Npas4 indirectly affects gene expression by altering neuronal 
excitation. 

Npas4 appears to regulate a wide variety of genes, such as activity- 
regulated immediate early genes, various classes of transcription fac- 
tors, channel proteins, G-protein signalling molecules, kinases and 
phosphatases, and genes involved in pathways that modulate syn- 
aptic functions, such as ubiquitination, trafficking and receptor 
endocytosis (Fig. 5b and Supplementary List). Interestingly, the func- 
tions of 94 of the 270 putative Npas4-regulated genes are uncharac- 
terized (Fig. 5b), suggesting that Npas4 regulates many genes that 
could affect inhibitory synapses in novel ways. 

As a first step towards understanding the genetic program regu- 
lated by Npas4, we focused on targets that might be directly involved 
in the development of GABAergic synapses. Brain-derived neuro- 
trophic factor (BDNF) stood out because it had previously been 
shown to regulate GABAergic synapse maturation and func- 
tion'*°?”~°, BDNF expression is consistently reduced by almost two- 
fold in cultures expressing Npas4-RNAi compared with control 
cultures (Fig. 5c). Primary cultures from Npas4 ‘~ mice showed a 
similar decrease in depolarization-induced BDNF expression com- 
pared with their wild-type littermates (Fig. 5d). 

The BDNF gene has many promoters and the activity-dependent 
BDNF mRNA transcripts are controlled by promoters I and IV’. 
Using a chromatin immunoprecipitation assay, we asked whether 
Npas4 binds to these promoters in primary cortical cultures that were 
left untreated or membrane-depolarized to trigger Npas4 expression. 
Although no Npas4 binding was detected in untreated neurons, in 
membrane-depolarized neurons Npas4 was found to be associated 
with BDNF promoters I and IV, but not with the coding region or 3’ 
untranslated region (Fig. 5e), suggesting that Npas4 may directly 
regulate activity-dependent expression of BDNF. 

To determine whether BDNF contributes to the effects of Npas4 
on GABAergic synapse development, we tested whether the ability of 
the Npas4-minigene to increase the number of inhibitory synapses is 
attenuated by knockdown of BDNF expression, using a previously 
validated small hairpin RNA that targets the BDNF coding region 
(BDNF-RNAi)*. Confirming that BDNF increases inhibitory syn- 
apse number**, BDNF-RNAi increased mIPSC inter-event inter- 
vals compared with control values (2986.3 + 109.7 and 4086.1 + 
140.7ms, control and BDNF-RNAi, respectively, P<0.01). 
Expression of the Npas4-minigene alone led to an approximately 
40% decrease in the inter-event interval of mIPSCs recorded from 
CA1 neurons (Fig. 3b, c), as previously observed, but this was partly 
attenuated to approximately 20% by the presence of BDNF-RNAi 
(Fig. 6a). In addition, in the presence of BDNF-RNAi, the effect of the 
Npas4-minigene on the amplitude of mIPSCs was completely 
reversed (Fig. 6b). These findings suggest that BDNF mediates a 
portion of the effect of Npas4 on inhibitory synapse number, but 
that additional Npas4 targets may also be involved. 


Discussion 


We have identified the activity-regulated transcription factor Npas4 
as a key regulator of GABAergic synapse development. Excitatory 
synaptic activity induces Npas4 in a Ca**-dependent manner, and 
the level of Npas4 determines the number of functional GABAergic 
synapses by controlling a program of activity-dependent gene 
expression. Future characterization of Npas4 target genes will help 
to determine whether Npas4 acts by initiating inhibitory synapse 
formation, stabilizing nascent inhibitory synapses or promoting 
the maturation of weak inhibitory synapses. It is possible that differ- 
ent subsets of Npas4 targets control the development of GABAergic 
synapses formed by distinct classes of interneurons, providing a 
mechanism for the independent regulation of inhibition received 
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Figure 6 | Knockdown of BDNF partially attenuates the ability of the 
Npas4-minigene to elevate GABAergic synapses. a, Cumulative 
distributions (left) and mean = s.e.m. (right) of mIPSC inter-event intervals 
in neurons transfected with BDNF-RNAi (4086.1 + 140.7 ms) or BDNF- 
RNAi + Npas4-minigene (3257.9 + 117.7 ms). b, Cumulative distributions 
(left) and mean = s.e.m. (30.1 + 0.9 and 27.1 + 0.7 pA, BDNF-RNAi versus 
BDNF-RNAi + Npas4-minigene, right) of mIPSC amplitudes. Total 
numbers of neurons analysed in each condition (n) are indicated. 

***D < 0,001. 


by subregions of a neuron. Another intriguing possibility is that 
Npas4 controls experience-dependent developmental processes, 
such as critical period plasticity in the visual cortex, which depend 
on the function and maturation of GABAergic synapses. 

Although Npasé4 is not required for the initial formation of excit- 
atory synapses, activation of Npas4 in excitatory neurons within a 
neural circuit appears to diminish the excitatory synaptic input they 
receive. It is not known whether this effect is mediated directly by 
Npasé or is an indirect consequence of changes in inhibitory input. In 
either case, our findings suggest that Npas4 functions as part of the 
homeostatic mechanism that stabilizes the activity of a neuron in the 
face of changing glutamatergic input. Further investigation of the 
function of Npas4 and other activity-dependent regulators of inhib- 
itory synapses” will provide insight into the mechanism by which 
neuronal activity controls the balance between excitation and inhibi- 
tion in the brain, and how the disruption of this balance leads to 
neurological disorders such as autism, epilepsy and schizophrenia. 


METHODS SUMMARY 

Dissociated neuron culture and transfection. Dissociated cortical and hippo- 
campal neurons were prepared from E18 rat or E16 mouse embryos as previously 
described**. Cultures were maintained in Neurobasal Medium supplemented 
with B27 (Invitrogen), penicillin-streptomycin and glutamine. Neurons were 
plated at 100,000-150,000 per well in a 24-well plate or 15,000,000 per 10-cm 
plate. For synapse assays, hippocampal neurons (60,000 per well, 24-well plate) 
were plated on a monolayer of astrocytes’’. Neurons were transfected at 5-6 days 
in vitro (DIV) using the calcium phosphate method”. 

Organotypic slice culture and transfection. Organotypic hippocampal slice 
cultures were prepared from P6 rats or mice as previously described“. Slices 
were biolistically transfected with a Helios Gene Gun (Biorad) after 2 days. 
Bullets for the gene gun were 1.6-\1m gold particles coated with 15 ug eGFP 
and either 5 or 101g RNAi construct, 45 ug Npas4-minigene or 301g Cre. 
Empty plasmid was added to bring the total DNA to 60 ug in each case. 
Synapse density assay. Hippocampal neurons (14-18 DIV) were immuno- 
stained for synaptic markers and imaged on a Zeiss LSM5 Pascal microscope 
using a X63 objective lens. Image acquisition and synapse quantification were 
performed in a blinded manner. Glutamatergic neurons were identified based on 
morphology and the absence of cytosolic GAD65 staining. Synapse density was 
measured using Metamorph software as previously described*’. Within each 
experiment, density of puncta was normalized against the control, and the error 
of the control propagated into each experimental condition. Statistical analysis 
was performed on the raw data (without normalization) using multifactorial 
analysis of variance in StatView 4.5 (Abacus Concepts). 

Electrophysiology. Whole-cell patch clamp recordings were made at room tem- 
perature from CA1 pyramidal neurons 7-10 days after transfection. Data were 
analysed in IgorPro (Wavemetrics) using custom-written macros. Statistical 
significance was determined by Kolmogorov—Smirnov test and Monte Carlo 
simulation. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

DNA constructs. Npas4-RNAi, 5’-GGTTGACCCTGATAATTTA-3’, and con- 
trol-RNAi with a scrambled sequence (underlined), 5’-GGTTCAGCGTCATAA 
TTTA-3’, were cloned into the pSuper expression vector (OligoEngine). The 
same Npas4-RNAi was used to generate the lentivirus construct (Cellogenetics, 
Inc). The control lentivirus construct, pLenti-shGL3 (Cellogenetics, Inc), targets 
the luciferase gene, which is not expressed by mouse hippocampal neurons. 

The Npas4-minigene was generated by subcloning the mouse genomic region 
from CTCGGTTTCTATCTTCATGCC to GCCATGTGGCCTGCCGGTAC into 
a pBluescriptlIk + vector between the SacI and KpnI digestion sites. The 
sequence targeted by Npas4-RNAi was mutated to AgtCgaTccCgataattta, preserv- 
ing the amino-acid sequence, to generate an RNAi-resistant Npas4-minigene. 

The Npas¢4 luciferase reporter was constructed by replacing the tandem MEF2 

responsive elements with three copies of the Npas4 responsive element” in the 
3X MRE-luc plasmid”. 
Immunocytochemistry and immunohistochemistry. Cultured neurons were 
fixed and immunostained as previously described*’. For immunohistochemistry, 
mice were perfused intracardially with cold PBS followed by 4% paraformalde- 
hyde in PBS. Brains were post-fixed in 4% paraformaldehyde in PBS at 4°C 
overnight followed by incubation in 30% sucrose in PBS at 4 °C for 24h. These 
cryo-protected brains were immediately sectioned on a cryostat (Leica) or stored 
in Tissue-Tec O.C.T. at —80 °C for later sectioning. Matching brain sections 
(35 um) were incubated in blocking solution (3% BSA, 3% goat serum, 0.3% 
Triton X-100, 0.2% Tween-20 in PBS) at room temperature for 2h. 
Subsequently, sections were incubated with primary antibodies overnight at 
4°C and with secondary antibodies at room temperature for 2 h, both in block- 
ing solution. Finally, sections were mounted on slides with Aquamount (Lerner 
Laboratories). 

The following antibodies were used: GAD65 (mouse, 1:1,000, Chemicon), 

GABAa-receptor y2 subunit (rabbit, 1:400, Chemicon), PSD95 (mouse, 1:300, 
Affinity BioReagents), synapsin I (rabbit, 1:500, Chemicon), c-fos (rabbit, 
1:1,000, Santa Cruz sc-52), phosphor-serine 133 CREB (mouse, 1:2,000, 
Upstate). The Npas4 antibody (rabbit, 1:2,000) was raised against a carboxy- 
terminal region of Npas4 (amino acids 597-802) and its specificity was tested in 
Npas4 /~ mice (Supplementary Fig. 2). 
Synapse density assay. Confocal image acquisition and synapse density mea- 
surement were performed as previously described*’. The perisomatic region of a 
neuron was defined as the cell body plus a 20-pixel circumference. Each pixel is 
approximately 0.143 um. The perisomatic region was excluded from the excit- 
atory synapse density analysis. 

Because synapse density and immunostaining vary significantly between 
experiments, it is necessary to normalize each experiment before combining 
them. Normalization and error propagation were performed as previously 
described”. 

Electrophysiology. Slices were perfused with artificial cerebrospinal fluid con- 
taining (inmM) 127 NaCl, 25 NaHCOs, 1.25 Na,HPOg, 2.5 KCl, 2 CaCh, 1 
MgCl, 25 glucose, and saturated with 95% O., 5% CO». The internal solution 
for mIPSCs contained (in mM) 147 CsCl, 5 Naz-phosphocreatine, 10 HEPES, 2 
MgATP, 0.3 Na,GTP and 1 EGTA. The internal solution for mEPSCs contained 
(inmM) 120 cesium methane sulfonate, 10 HEPES, 4 MgCl, 4 Na,ATP, 0.4 
Na,GTP, 10 sodium phosphocreatine and 1 EGTA. Osmolarity and pH were 
adjusted to 300 mOsm and 7.3 with Millipore water and CsOH, respectively. 
mIPSCs were pharmacologically isolated by bath application of (in 1M) 0.5 
tetrodotoxin, 10 (R)-CPP and 10 NBQX disodium salt (all from Tocris 
Bioscience); mEPSCs were isolated with 0.5 tetrodotoxin and 50 picrotoxin 
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(Tocris Bioscience), and augmented with 10 cyclothiazide. Cells with series 
resistance larger than 25 MQ during the recordings were discarded. 

Data were analysed in IgorPro (Wavemetrics) using custom-written macros. 
For each trace, the event threshold was set at 1.5 times the root-mean-square 
current. Currents were counted as events if they crossed the event threshold, had 
a rapid rise time (1.5 pA ms_') and had an exponential decay (2 <1 < 200 ms, 
1<t<50ms for mIPSC and mEPSC, respectively). 

Statistical significance was determined by two methods. First, 50 random 

points selected from each cell were concatenated to describe the cumulative 
distributions of events in each condition and then compared by a 
Kolmogorov-Smirnov test. Second, a Monte Carlo simulation was performed 
in which points were randomly sampled from each condition and the mean of 
these samples compared at least 1,000 times. P< 0.05 from both tests was con- 
sidered significant. 
DNA microarrays and data analysis. For all microarray experiments, total RNA 
was purified using RNeasy mini kits (Qiagen) and biotin-labelled CRNA was 
generated following Affymetrix standard protocols. Ten micrograms of the 
labelled CRNA was hybridized to Affymetrix mouse MOE430 arrays. 

To identify activity-regulated genes, PO mouse cortical neurons were cultured 
for 7 days and then depolarized with 50mM KC] for 1 and 6h. To identify 
Npas4-regulated genes, E16 mouse hippocampal neurons were infected 
(3 DIV) with Npas4-RNAi or control lentivirus, depolarized (8 DIV) with 
50mM KCI, and total RNA was collected 0, 1, 3 and 6h later. Drebrin, an 
Npas4 target previously identified in a neuroblastoma cell line* was not affected 
by Npas4-RNAi in post-mitotic neurons. 

The DNA-Chip (dChip) software package*’ was used to analyse the micro- 

array data. Genes were considered candidates if they met the following criteria: 
(1) there was at least a 1.5-fold difference between the experimental and control 
conditions (depolarized versus non-depolarized, or Npas4-RNAi virus versus 
control virus) at any of the time points; and (2) there was an absolute difference 
of more than 100 normalized hybridization intensity units in the expression level 
between the experimental and control samples. 
Chromatin immunoprecipitation. Rat cortical neurons were treated with 1 1M 
tetrodotoxin and 100 1M AP5 overnight and then stimulated with 55 mM KCI 
for 2h at 8 DIV. Chromatin immunoprecipitation was performed as previously 
described”’. The following primers were used for quantitative PCR: Chr3, for- 
ward: 5'-GACCCCATCCTGTGGTTATG-3’, reverse: 5’-GCAACAAAGGCA 
AATGGAAT-3'; Npas4, forward: 5’-CAGGATGACTCACACTGACAGTATT 
TTTAG-3’, reverse: 5’-GTGGGAGAAGAGCTATTTATATCACCAG-3'; BDNF 
promoter I, forward: 5'-GITGCCTCTCGCCTAGTCATC-3’, reverse: 5'-AGGG 
AACAACTGCGTGAATC-3’; BDNF promoter IV, forward: 5’-CAGGGAGC 
TACTCACCCAAC-3’, reverse: 5’-GCACACAGAAGCCAAACCTT-3'; BDNF 
coding region, forward: 5'-GACAAGGCAACTTGGCCTAC-3’, reverse: 5'-TC 
GTCAGACCTCTCGAACCT-3’; BDNF 3’ untranslated region, forward: 5’-GA 
AACGCCACACCTACGAAT-3’, reverse: 5’'-GGCACCGAATCCAATTCTAA-3’. 
Quantitative reverse transcriptase PCR. The mRNA level of the gene of interest 
was normalized against the mRNA level of B-tubulin in the same sample. The 
following primers were used: Nopas4, forward: 5’-GCTATA 
CTCAGAAGGTCCAGAAGGC-3’, reverse: 5'-TCAGAGAATGAGGGTAGCA 
CAGC-3'; BDNF, forward: 5'’-GATGCCGCAAACATGTCTATGA-3’, reverse: 
5'-TAATACTGTCACACACGCTCAGCTC-3’; B-tubulin, forward: 5’-CGAC 
AATGAAGCCCTCTACGAC-3’, reverse: 5’-ATGGTGGCAGACACAAGGTGG 
TTG-3’. 


50. Flavell, S. W. et al. Activity-dependent regulation of MEF2 transcription factors 
suppresses excitatory synapse number. Science 311, 1008-1012 (2006). 
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Glycogen synthase kinase 3 in MLL 
leukaemia maintenance and 
targeted therapy 


Zhong Wang’, Kevin S. Smith’, Mark Murphy’, Obdulio Piloto', Tim C. P. Somervaille' & Michael L. Cleary’ 


Glycogen synthase kinase 3 (GSK3) is a multifunctional serine/threonine kinase that participates in numerous signalling 
pathways involved in diverse physiological processes. Several of these pathways are implicated in disease pathogenesis, 
which has prompted efforts to develop GSK3-specific inhibitors for therapeutic applications. However, before now, there has 
been no strong rationale for targeting GSK3 in malignancies. Here we report pharmacological, physiological and genetic 
studies that demonstrate an oncogenic requirement for GSK3 in the maintenance of a specific subtype of poor prognosis 
human leukaemia, genetically defined by mutations of the MLL proto-oncogene. In contrast to its previously characterized 
roles in suppression of neoplasia-associated signalling pathways, GSK3 paradoxically supports MLL leukaemia cell 
proliferation and transformation by a mechanism that ultimately involves destabilization of the cyclin-dependent kinase 
inhibitor p27“"". Inhibition of GSK3 in a preclinical murine model of MLL leukaemia provides promising evidence of efficacy 


and earmarks GSK3 as a candidate cancer drug target. 


GSK3 is a serine/threonine kinase that functions in numerous signal- 
ling pathways initiated by diverse stimuli’. Originally studied for its 
role in glycogen metabolism and insulin action, GSK3 has subse- 
quently been shown to have central functions in many cellular and 
physiological processes including transcription, cell cycle division, 
apoptosis, cell fate determination and stem cell maintenance, among 
others’ *. GSK3 is constitutively active in resting cells, showing a pref- 
erence for primed substrates’, and is functionally inactivated after 
phosphorylation by various kinases in response to different stimuli*”. 
Given its various contributions and the diversity of putative sub- 
strates, many levels of regulation help confer GSK3 signalling specifi- 
city, which varies among cell types and their states of differentiation. 

GSK3 functions in several pathways implicated in human diseases, 
which has prompted efforts to develop specific inhibitors for thera- 
peutic applications. GSK3 facilitates non-insulin-dependent diabetes 
by the inactivation of glycogen synthase*®, and may have a role in 
promoting various inflammatory processes through the activation of 
the transcription factor nuclear factor-«B by, at present, undefined 
mechanisms’*. GSK3-mediated hyperphosphorylation of tau (also 
known as MAPT), a component of neurofibrillary tangles, may facil- 
itate Alzheimer’s disease and other neurodegenerative disorders’. In 
cancer cells, however, signalling pathways that are normally sup- 
pressed by GSK3—such as Wnt and Hedgehog, which are involved 
in embryonic cell fate determination and normal stem cell mainten- 
ance—are aberrantly activated’*'*. This underscores the normal role 
of GSK3 in mediating phosphorylation of substrates such as B-cate- 
nin (Wnt signalling), MYCN (Hedgehog signalling) and JUN, which 
leads to their destruction and/or inactivation, thus inhibiting signals 
that otherwise promote proliferation and __ self-renewal'*° 
(Supplementary Fig. 1). Consistent with these molecular functions, 
GSK3 inhibition significantly enhances maintenance of embryonic 
stem cell pluripotency and haematopoietic stem cell repopulation 
after bone marrow transplantation’”"’, although the specific path- 
ways for these effects remain undefined. Despite its inhibitory roles in 


pathways implicated in cancer pathogenesis, there has so far been no 
compelling rationale for the targeting of GSK3 as a therapeutic 
approach in malignancies. Here we demonstrate a paradoxical and 
unexpected role for GSK3 in cancer maintenance, and we establish 
GSK3 as a potential selective therapeutic target in a genetically 
distinctive and poor prognosis subset of acute leukaemia. 


GSK3 inhibition induces G1 arrest of MLL leukaemia cells 


A small-scale screen was conducted to identify compounds that spe- 
cifically blocked the growth of genetically defined subsets of leuk- 
aemia cells. Thirty compounds (Supplementary Table 1) that target 
principal kinases or other enzymes were screened for differential 
dose-responses in various cell lines (Supplementary Table 2). These 
cell lines represent human leukaemias harbouring a variety of 
chromosomal translocations that create distinctive chimaeric fusion 
proteins implicated in disease pathogenesis. The leukaemia cell lines 
were comparably sensitive to most of the tested compounds (data not 
shown). However, cell lines that expressed MLL—AF4 or MLL-AF5, 
the highly related fusion oncogenes created by t(4;11) or t(5;11) 
chromosomal translocations, respectively, showed enhanced sens- 
itivity to GSK3-IX, a GSK3 inhibitor that also targets cyclin-depend- 
ent kinases (CDKs; Fig. 1a; for clarity, only two representative control 
cell lines are shown). Their proliferation was inhibited at a half-max- 
imal inhibitory concentration (ICs9) of 0.3—2 UM, a concentration 
range comparable to that which promotes expansion of haemato- 
poietic stem cells in vitro'*, but tenfold lower than the toxicity levels 
for non- MLL leukaemia cell lines (Fig. 1a) and normal bone marrow 
progenitors (see later). In contrast, the CDK inhibitors roscovitine 
(Fig. 1a), flavopiridol and olomoucine (data not shown) had similar 
ICsp values for all cell lines, suggesting that the inhibitory effects of 
GSK3-IX on MLL cell lines resulted from GSK3, not CDK, blockade. 
Further studies with SB216763 (a widely used maleimide-containing 
GSK3 inhibitor with a relatively higher ICs) than GSK3-IX) and with 
alsterpaullone (which has a similar inhibition profile as GSK3-IX) 


'Department of Pathology, Stanford University School of Medicine, Stanford, California 94305, USA. 
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confirmed that MLL leukaemia cells were differentially sensitive to 
GSK3 inhibition (Fig. 1b and data not shown). Increased f-catenin 
levels correlated with effective GSK3 inhibition, which did not alter 
MLL oncoprotein abundance or function (Supplementary Fig. 2a, b). 

Cell cycle analyses showed a marked reduction in GI-S phase 
progression of MLL leukaemia cells after 24h of inhibitor treatment, 
whereas non-MLL leukaemia cells were only minimally affected 
(Fig. 1c). More prolonged incubation with inhibitor (6 days) was 
associated with cell death, as evidenced by a substantial increase in 
sub-G0/G1 DNA content (Supplementary Fig. 3a, b). These data 
suggest that GSK3, which is constitutively active in normal resting 
cells, paradoxically supports the proliferation and sustained survival 
of a genetically defined subset of leukaemia. 


GSK3 dependence is a general feature of MLL transformed cells 


Murine transformation models were used to characterize the role of 
GSK3 in MLL leukaemia further. Transduction of MLL oncogenes 
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Figure 1| Sensitivity of MLL leukaemia cell lines to GSK3 inhibition. a, The 
growth of human leukaemia cell lines was assessed after 3 days culture in the 
presence of the indicated concentrations of GSK3-IX (left panel) or 
roscovitine (right panel). The results are expressed as the cell numbers 
relative to those without drug treatment, and represent the mean of three 
independent experiments (+ s.e.m.). MTT, 3-(4,5-dimethylthiazol-2-yl)- 
2,5-diphenyltetrazolium bromide. b, Human leukaemia cell lines were 
cultured in the absence or presence of 10 14M SB216763 for 2 days. The 
results of a representative experiment are expressed as the fold change in cell 
number compared to day 0 (+ s.e.m., n = 3). ¢, Human leukaemia cell lines 
were cultured in the presence of 5 1M SB216763 for 24h, and BrdU 
incorporation was quantified by fluorescence-activated cell sorting (FACS) 
analysis. 
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into primary murine myeloid progenitors induces aberrant Hox gene 
expression'’”°, leading to enhanced self-renewal in vitro and acute 
myeloid leukaemias in vivo that accurately model the features of 
human MLL leukaemia*' without altering GSK3 levels or activity 
(Supplementary Fig. 3c, d). Culture of MLL-transduced myeloid 
progenitors with a GSK3 inhibitor reduced their clonogenic poten- 
tials and proliferation (Fig. 2a). This contrasted with progenitors 
immortalized by other fusion oncogenes (Fig. 2a and data not 
shown), which showed no adverse growth effects with 104M 
SB216763 treatment, as was also the case for primary myeloid pro- 
genitors (Supplementary Fig. 4). Inhibition of GSK3 primarily 
resulted in proliferative arrest of MLL transformed cells, but pro- 
longed exposure induced morphological features of myeloid differ- 
entiation (Fig. 2b, c) and reduced expression of c-Kit (also known as 
KIT) (data not shown), a phenotypic marker of normal progenitors 
and MLL leukaemia stem cells. Mouse B cell progenitors trans- 
formed by MLL-AF4, but not by other oncogenes, also showed mark- 
edly reduced proliferation in 10 uM SB216763 (Fig. 2d). These data 
suggest that GSK3 dependence may be a primary consequence and 
general feature of MLL transformation in several haematopoietic 
lineages. Furthermore, expression of a constitutively active mutant 
of the protein kinase AKT, which phosphorylates GSK3 and nega- 
tively regulates its kinase activity, resulted in suppression of cell 
growth and clonogenic potentials of mouse myeloid and B cell pro- 
genitors transformed by MLL oncogenes (Supplementary Fig. 5), 
providing support that MLL transformed cells are dependent on 
GSK3 for continued proliferation and maintenance of their trans- 
formed phenotypes in vitro. 
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Figure 2 | Sensitivity of MLL-transformed mouse B and myeloid 
progenitors to GSK3 inhibition. a, The growth of myeloid progenitors 
transformed by various oncogenes was assessed after 3 days of culture in the 
presence or absence of a GSK3 inhibitor. The results of a representative 
experiment are expressed as the fold change in cell number compared to 
day 0 (+ s.e.m., n = 3). b, c, The morphological features of MLL—ENL 
transformed myeloid progenitors were assessed after 4 days of culture in the 
presence or absence of GSK3 inhibitor. Original magnification, < 40. The bar 
graph (c) indicates the mean number of cells with the indicated 
morphological features (n = 3). d, The growth of B lymphoid progenitors 
transformed by E2A—HLF and BCL2 (BiEH)*’, TEL-AML1 or MLL-AF4 
oncogenes was assessed after 3 days of culture in the absence or presence 
(5 uM or 10 1M) of SB216763. The results of a representative experiment are 
expressed as the fold change in cell number compared to day 0 (= s.e.m., 
n= 3). 
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GSK3a/$ isoforms cooperatively maintain MLL transformation 


To investigate further the GSK3 requirement, myeloid progenitors 
were isolated from fetal livers of Gsk3b-’~ mice (embryonic day 
(E)16 embryos), transduced with retroviral vectors encoding MLL 
or unrelated oncogenes (Fig. 3a), and then serially replated in methyl- 
cellulose culture to assess their self-renewal properties. Gsk3b” /~ 
cells were capable of sustaining the enhanced self-renewal typically 
induced by MLL oncogenes, and did not show reduced clonogenic 
potentials compared with wild-type cells transduced with the same 
MLL oncogenes (Supplementary Fig. 6) despite a 50% reduction in 
overall GSK3 activity levels (Supplementary Fig. 7d). Thus, GSK3-B 
was not required to initiate MLL transformation in vitro. However, 
MLL-transformed Gsk3b-’~ cells showed markedly increased sens- 
itivity to pharmacological GSK3 inhibition (Fig. 3b, c), which was 
reversed by the forced expression of exogenous GSK3-f (Fig. 3d). In 
contrast, Gsk3b’~ cells transformed by other fusion oncogenes 
(Fig. 3b and data not shown) were unaffected by a several fold higher 
concentration of inhibitor. Thus, genetic reduction of GSK3-B levels, 
by knockout or short-hairpin-RNA-mediated knockdown (Supple- 
mentary Fig. 7a), in MLL-transformed myeloid progenitors resulted 
in increased sensitivity to pharmacological GSK3 inhibition. 
Persistence of the transformed phenotype but enhanced inhibitor 
sensitivity in the absence of GSK3-f suggested that the two GSK3 iso- 
forms probably have redundant roles in MLL transformation. Thus, 
GSK3-a knockdown studies were performed in myeloid progenitors, 
which resulted in efficient reduction of GSK3-« protein levels in wild- 
type as well as Gsk3b/~ cells, (Fig. 4a) accompanied by further 
decrease in total GSK3 activity to less than 20% of wild- 
type cells (Supplementary Fig. 7d). Unlike GSK3-8 knockout or 
knockdown cells, MLL-transformed cells deficient for GSK3-« 
(Gsk3a*”) did not show differences in growth or heightened sensitivity 
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Figure 3 | Genetic ablation of GSK3-f hypersensitizes MLL-transformed 
cells to pharmacological GSK3 inhibition. a, Western blot analysis 
demonstrates the amounts of GSK3 protein isoforms in wild type or 
Gsk3b ’~ myeloid progenitors transformed by the indicated oncogenes. 

b, Wild type (+/+) or Gsk3b /~ (—/—) myeloid progenitors transformed by 
various oncogenes were incubated in the presence of the indicated 
concentrations of SB216763. Cell numbers were enumerated on day 2 and 
expressed as the fold change compared to day 0 (= s.e.m. of triplicate 
analyses). c, Western blot analysis demonstrates the relative amounts of 
B-catenin after treatment with the indicated concentrations (uM) of 
inhibitor in wild type (WT) or Gsk3b ’~ myeloid progenitors transformed 
by MLL-ENL. d, Gsk3b ’~ myeloid progenitors transformed by MLL 
oncogenes were stably transduced with Flag—GSK3- (+) or vector (—), and 
then incubated in the presence or absence of 5 1M SB216763. Cell numbers 
were enumerated on day 2 and expressed as the fold change compared to 
day 0 (+ s.e.m. of triplicate analyses). 
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to GSK3 inhibitors (Supplementary Fig. 7a, b). However, myeloid 
progenitors deficient for both GSK3 isoforms (Gsk3b‘~ Gsk3a*”) 
showed a marked impairment in clonogenicity and proliferation com- 
pared to wild-type cells transformed by MLL oncogenes (Fig. 4b, c), 
and were unable to sustain long-term growth in culture. In contrast, 
the growth of cells transformed by other leukaemia oncogenes was 
unaffected by the compound deficiency of both GSK3 isoforms 
(Fig. 4b, c) despite substantially decreased GSK3 activity (Supple- 
mentary Fig. 7d). Gsk3b’~ Gsk3a*” cells transduced with MLL onco- 
genes also hada more differentiated myeloid morphology (Fig. 4d) and 
phenotype (not shown). Notably, MLL-ENL transformed cells lacking 
both GSK3 isoforms were unable to induce leukaemia in transplanted 
mice (Fig. 4e). Thus, GSK3 isoforms cooperatively maintain critical 
features of the MLL transformed phenotype, although GSK3-f serves a 
predominant role. 


Efficacy of GSK3 inhibition in a mouse model of MLL leukaemia 
A mouse model of MLL-associated leukaemia was used to assess 
whether GSK3 inhibition in vivo would affect the course of disease. 


a MLL-ENL E2A-HLF e = He aa pe 
WT Gsk3b— _ — uae 
Gsk3a shRNA: - + + - - - + - + om 
GSK3 =e 
—= he 0. 
see OF e8k3-p 50 
kDa p25 
62 - eee ee | Tubulin 0) 
4s-\— _ 0 25 50 75 100125 150 
Time after transplant (days) 
bs wVector m Gsk3a shRNA D Macrophagem Maturing f ) 1mM 
=120 Neutrophil gg Blast B2mMma4mM LiCl 
2 100 120 
5 100 » 80 » = 100 
2 80 S 5 L a 
S £ 60 2e 
S60 | 5 5 8 60 
8 40 S$ 40 = xe) 40 
c o 20 aS 
S 20 OS 40 
[o} 
O90 — T 0 u mu / 0 
MLL-ENL MLL-GAS7 E2A-HLF WT GI CSIo MLL-AF4 BiEH 
Gsk3b~“ cells Gskaa 
c fe g Regular chow 
ffs /- KD oie je} 
we ~~ WT -s Gsk3b--- Gsk3b~“ Gsk3a' =°5 dum chow 
>“) MLL-ENL NUP98-HOXAQ 
$10 100 
ce) S 
3 8 75 
Xe} <1 
=e 250 
® < 
a4 3 
= D 25 
22 
30 ° > qo 20 30 40 50 
e) 0 Ww 23 0 i 6268 
Time (days) Time after transplant (days) 


Figure 4 | Compound genetic deficiency of GSK3-a. and GSK3-f impairs the 
growth and leukemogenicity of MLL-transformed cells. a, Western blot 
analysis was performed on wild type (WT) and Gsk3b ’ myeloid 
progenitors transformed by the indicated oncogenes (top) and transduced 
by lentiviral vectors lacking (—) or expressing (+) Gsk3ashRNA. b, Myeloid 
progenitors transformed by the indicated oncogenes were acutely 
transduced with lentiviral vectors lacking or expressing Gsk3a shRNA and 
then plated in methylcellulose medium. Colonies were enumerated after 

5 days, and the mean (+ s.e.m.) numbers of three independent 
determinations are expressed relative to vector alone. c, Proliferation of 
myeloid progenitors (WT, Gsk3b ’~ or Gsk3b ’~ Gsk3a™”) transformed by 
MLL-ENL (left panel) or NUP98—HOXA9 (right panel) was assessed at the 
indicated days in liquid culture (+ s.e.m. of triplicate analyses). d, The 
morphological features of MLL—ENL transformed myeloid progenitors with 
the indicated genotypes were assessed after 4 days of culture. The bar graph 
indicates the mean number of cells with the indicated morphological 
features (n = 3). e, Survival curves are shown for cohorts of mice 
transplanted with cells (WT, Gsk3b ’~ or Gsk3b ’~ Gsk3a*”) stably 
transduced with MLL—ENL (10 mice each). f, Cell numbers were determined 
after 3 days of culture in the indicated concentrations of lithium chloride. 
g, Survival curves show significantly different latencies (P < 0.001) for the 
development of acute leukaemia in cohorts of mice transplanted with 
MLL-AF4 leukaemia cells (5 X 10*) and maintained on normal or lithium 
carbonate (0.4%) laced chow as indicated. 
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Mice transplanted with MLL—AF4-transformed B cell precursors 
developed a fatal aggressive leukaemia within 29 days, characterized 
by massive infiltration of the bone marrow, spleen and liver, with 
leukaemic blasts (data not shown). However, treatment with lithium 
carbonate, which has been extensively used to modulate in vivo GSK3 
kinase activity” and impairs MLL leukaemia cell proliferation in vitro 
(Fig. 4f), resulted in a significant prolongation of survival (Fig. 4g and 
Supplementary Fig. 7e). These results indicate that sensitivity of 
MLL-transformed cells to GSK3 inhibition is not restricted to in vitro 
environments, and provide evidence of therapeutic efficacy. 


p27“! mediates the response to GSK3 inhibition 


Cell cycle arrest in response to GSK3 inhibition suggested that cell 
cycle regulators may be downstream targets of GSK3 signalling in 
MLL-transformed cells. Western blot analysis implicated the CDK 
inhibitor (CDKI) p27“ ' as the levels of this significantly increased in 
human MLL leukaemia cells (Fig. 5a) and murine transformed 
progenitors (Fig. 5b) after inhibitor treatment, which is temporally 
consistent with the onset of cell cycle arrest (Fig. lc and 
Supplementary Fig. 3a). Conversely, p27“! levels did not increase 
in control cells (Fig. 5a, b), which continued to actively cycle in the 
presence of inhibitor (Supplementary Fig. 3 and data not shown). 
GSK3, either directly or indirectly, negatively regulates p27“?! 
protein stability because inhibitor treatment increased the p27’?! 
half-life without inducing increased messenger RNA levels 
(Supplementary Fig. 8). B-Catenin levels increased substantially in 
MLL-transformed but also in control cells in response to GSK3 
inhibition, whereas levels of p21, another CDKI, were not altered 
(Fig. 5a, b). Knockdown of p27‘?! resulted in substantial reductions 
of p27‘! protein (Fig. 5c), and prevented the growth arrest otherwise 
induced by GSK3 inhibitor in MLL-transformed myeloid progenitors 
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Figure 5 | GSK3 maintains MLL transformation through suppression of 
p27*P1, a, Human leukaemia cell lines (RCH-ACV and KP-L-RY) were 
treated with 10 |1M SB216763 for the indicated times and protein levels were 
assessed by western blot analysis. b, Murine myeloid progenitors 
transformed by the indicated oncogenes were cultured in the presence (+) 
or absence (—) of SB216763 (10 1M) for 24h in liquid culture, and then 
subjected to western blot analysis. ¢, Western blot analysis demonstrates the 
p27‘?! (upper panel) or -tubulin (lower panel, loading control) protein 
levels in MLL-ENL transformed myeloid progenitors stably transduced with 
lentiviral vectors expressing shRNAs specific for p27“?! (denoted as sh1 to 
sh3). d, Myeloid progenitors transformed by MLL-ENL and stably 
transduced with lentiviral vectors lacking (—) or expressing one of three 
different p27“?! shRNAs were cultured for 3 days in the presence or absence 
of 10 1M SB216763. Viable cell numbers are expressed relative to the 
numbers obtained with lentiviral vector transduced cells (+ s.e.m. of 
triplicate analyses). 
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(Fig. 5d). Thus, p27“! is a critical downstream mediator of the cell 
cycle arrest associated with GSK3 inhibition in MLL transformed cells. 


Discussion 


GSK3 maintenance of a genetically distinctive subset of acute leuk- 
aemia establishes an enabling role for this multifunctional kinase in 
oncogenesis. This contrasts with its well-characterized function to 
suppress signalling pathways that otherwise promote proliferation 
and self-renewal, a role thwarted in colon cancer and other cancers 
with oncogenic mutations of B-catenin that abrogate its GSK3- 
mediated phosphorylation and subsequent destruction on the Wnt 
pathway”. Similarly, hyperactivation of AKT is implicated in cancer 
pathogenesis through enhanced survival and proliferation”. In con- 
trast, MLL-transformed cells are sustained by GSK3 and consistently 
antagonized by GSK3 inhibitors of varying selectivity and specificity 
(Supplementary Fig. 1), and also adversely affected by constitutively 
active AKT, a physiological inhibitor of GSK3 activity. Genetic and 
pharmacological studies confirm the requirement of GSK3 to main- 
tain MLL-mediated transformation and leukaemogenesis in precli- 
nical murine models. Thus, GSK3 can promote oncogenesis and does 
not have an exclusively suppressive role in cancer pathogenesis. 

The mechanism by which GSK3 supports MLL-oncogene-induced 
proliferation and transformation is mediated through the destabili- 
zation of p27“'?!, a CDKI with established roles in tumour suppres- 
sion®®. The p27?! and p18'"** genes have been shown to be direct 
transcriptional targets of MLL”, a histone methyltransferase that 
positively maintains gene expression through covalent chromatin 
modification”. In endocrine neoplasias, this tumour suppressor 
circuit that normally maintains CDKI expression is abrogated by 
mutations or loss of menin (also known as multiple endocrine neo- 
plasia)”’, a critical component of the MLL histone methyltransferase 
complex”*”®, MLL oncoproteins also activate p27‘?! expression”, 
which would seem to be counterproductive for leukaemia pathogen- 
esis. Our results indicate a potential mechanism for the suppression 
of p27‘! either directly or indirectly by GSK3, which provides a 
permissive cellular context for MLL-mediated transformation. 
However, phosphorylation of p27“?! by GSK3 has recently been 
shown to enhance its stability and prohibit cell cycle progression in 
the absence of growth factors*', which contrasts with increased 
p27*' levels after GSK3 inhibition in MLL-transformed cells. 
Paradoxically increased p27“?! expression has also been observed 
in a myeloma cell line on GSK3 inhibition as part of a paracrine/ 
autocrine feedback loop involving IL-6 signalling and forkhead tran- 
scription factors’. Therefore, the functional relationships of GSK3 
with p27“?! seem complex and cell context dependent. Nevertheless, 
our studies link these factors on a pathogenic pathway that is critical 
for maintenance of MLL leukaemia. 

The observed dependence on GSK3 provides a potential thera- 
peutic target in a genetically distinctive subset of leukaemia defined 
by mutations of the MLL proto-oncogene. MLL is activated by a 
substantial array of chromosomal aberrations in diverse haematolo- 
gical disorders that account for approximately 5%—10% of sporadic 
leukaemias in adults and children*’. Independent of their association 
with other high-risk features, MLL aberrations are often predictive of 
poor clinical outcome™, which warrants a search for new treatment 
approaches. GSK3 has not previously been considered as a thera- 
peutic target in cancer. In fact, its normally suppressive roles in 
Wnt, hedgehog and Notch pathway signalling have raised the theor- 
etical possibility that GSK3 inhibition may increase the risk of neo- 
plasia. However, chronic administration of lithium, a relatively 
nonspecific GSK3 inhibitor used for the treatment of bipolar disor- 
ders, has not been associated with increased cancer risk*’. Notably, 
GSK3 is a specific in vivo modulator of haematopoietic stem cell 
activity, and GSK3 inhibitors enhance haematopoietic stem cell 
repopulation in mice after bone marrow transplantation'’. Thus, 
like PTEN*°, GSK3 has converse roles in normal versus leukaemia 
stem cell maintenance, which may confer significant therapeutic 
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selectivity. Our preclinical studies using lithium carbonate to target 
GSK3 in a murine model of MLL leukaemia provide promising evid- 
ence of efficacy. The paradoxical sensitivity of MLL leukaemias to 
GSK3 inhibition earmarks this multifunctional kinase as a thera- 
peutic target, and provides a rationale to develop inhibitors with 
suitable pharmacodynamic properties for clinical trials to determine 
whether GSK3 inhibition may have therapeutic efficacy in a distinct- 
ive subset of poor prognosis leukaemia. 


METHODS SUMMARY 

Cell cultures and inhibitor assays. Human leukaemia cell lines, and trans- 
formed mouse myeloid or B cell precursors, were cultured in medium (with 
appropriate supplements) containing kinase inhibitors at the indicated concen- 
trations. Cell viabilities were determined by trypan-blue dye exclusion and cell 
growth was quantified using MTT (3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyl- 
tetrazolium bromide) assays. Cell proliferation was determined by measuring 
BrdU incorporation. Flow cytometry was used to assess cell cycle status on the 
basis of propidium iodide staining and to quantify apoptosis on the basis of 
annexin V staining”. 

Transformation and leukaemogenesis assays. Myeloid progenitors were trans- 
duced with retroviral vectors as described previously” with minor modifica- 
tions, and were cultured in liquid or semi-solid medium supplemented with 
cytokines. B cell progenitors were transduced as described previously’’ and co- 
cultured on neo-resistant-irradiated OP9 stromal cells. After continuous passage 
and adaptation to liquid culture, immortalized cells were used for injections of 
syngeneic mice, and cell lines generated by explantation of splenocytes collected 
from leukaemic mice were used for GSK3 inhibitor studies. For knockdown 
studies, transformed progenitors were transduced with shRNA lentiviral con- 
structs, selected for drug resistance in vitro and then evaluated for growth in the 
presence of GSK3 inhibitors. For in vivo studies, myeloid progenitors (wild type 
or Gsk3b/~) transformed by MLL-ENL were transduced with lentiviral knock- 
down constructs, selected for drug resistance and then transplanted (10° cells) by 
intravenous injection into sub-lethally irradiated severe combined immuno- 
deficient (SCID) mice. For lithium treatment, irradiated mice were transplanted 
with MLL—AF4 leukaemic B cell progenitors (50,000 cells) and maintained on 
0.4% lithium-carbonate-containing chow with saline water. 

Protein assays. Protein extracts were prepared by cell lysis in buffer containing 
protease inhibitors, subjected to SDS-PAGE and analysed by western blot using 
primary antibodies as indicated throughout. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. C57BL/6 and SCID mice were obtained from the breeding facility of the 
Stanford University Veterinary Service Center. Gsk3b*/~ mice (provided by G. 
R. Crabtree with permission from J. R. Woodgett) were maintained on a CD1 
genetic background. All experiments on mice were performed with the approval 
and in accordance with Stanford’s Administrative Panel on Laboratory Animal 
Care. 

Inhibitors. The GSK3 inhibitors SB216763 (Sigma), GSK3-IX and alsterpaullone 
(EMDbiosciences) were dissolved in dimethylsulphoxide and used at the indi- 
cated concentrations. All other inhibitors (EMDbiosciences) were dissolved in 
dimethylsulphoxide and used at the concentrations indicated in Supplementary 
Table 1. 

Cell cultures. All human leukaemia cell lines (Supplementary Table 2) were 
maintained in R10 medium (RPMI1640 supplemented with 10% FBS, 1% 
L-glutamine and penicillin/streptomycin). Immortalized mouse myeloid cells 
were maintained in R20/20 medium (RPMI1640 with 20% FCS, 20% WEHI- 
conditioned medium, 1% .-glutamine and penicillin/streptomycin). Immor- 
talized mouse B cells were cultured in OP9 medium (MEMa+ GlutaMax 1 with 
10% FBS, 1% L-glutamine, penicillin/streptomycin and 2 1M fB-mercaptoethanol) 
containing 1 ng ml”! IL-7 when necessary. All culture medium was obtained from 
Gibco. 

DNA constructs and virus production. Retroviral constructs (MSCV vector) 
encoding MLL-—ENL, MLL-LAF4, MLL-AF6, MLL-GAS7, E2A—PBX1, NUP98— 
HOXA9 and E2A-HLF were reported previously*'*’’”°. Retroviral constructs 
encoding CA—AKT, ER-CA-AKT", and wild-type or S9A mutant GSK3B were 
constructed by cloning the respective complementary DNAs into MSCV using 
standard cloning techniques. Retrovirus production was performed as described 
previously’. Oligonucleotides for specific shaRNA knockdown of Gsk3a, Gsk3b 
or p27*'?! (sequences in Supplementary Table 3) were designed using 
PSICOLIGOMAKER 1.5 software (http://web.mit.edu/jacks-lab/protocols_ 
table.html), and cloned into pSicoR-Puro or pSicoR-Hygromycin lentiviral vec- 
tors. Lentiviral stocks were produced as described previously”’. 

Murine progenitor transformation assays. Myeloid progenitor transformation 
was performed as described previously”? with minor modifications. In brief, 
c-Kit* cells were isolated from the bone marrow of 4-8-week-old C57BL/6 mice 
or E16 fetal livers (Gsk3b / ~ mice) using an auto-MACS and anti-c-Kit beads 
(Miltenyi Biotech). The c-Kit™ cells were spinoculated with retroviral supernat- 
ant in the presence of 5g ml _' polybrene for 2h at 1,350g and at 32°C. After 
spinoculation and after overnight culture, cells were plated in methylcellulose 
medium (M3231; Stem Cell Technologies) containing 20 ng ml ' stem cell fac- 
tor, 10ngml_' IL-6, granulocyte-macrophage colony-stimulating factor (GM- 
CSF), and IL-3 (R&D Systems) with appropriate antibiotic selection. After 5-7 
days of culture, colonies were counted, pooled, and then 10* cells were replated 
in the same medium but without antibiotic. At the end of the fourth round, cells 
were transferred to R20/20 medium to establish continuous cell lines. 

B cell progenitors were transduced as described previously’’ with minor mod- 
ifications. Transduced cells were co-cultured on neo-resistant-irradiated OP9 
stromal cells. After continuous passage and adaptation to liquid culture, immor- 
talized cell lines were used for injections of syngeneic recipient mice. MLL-AF4B 
cell precursor leukaemia cell lines were generated by explantation of splenocytes 
collected from leukaemic mice. 

Transduction of immortalized mouse cells. Immortalized mouse cells (20,000) 
were transduced with retroviral or lentiviral constructs by spinoculation at 


nature 


2,500g or 1,200g for 2h at 32°C. Transduced cells were then resuspended in 
200 pl of R20/20 or OP9 medium and transferred to 96-well plates. After over- 
night incubation at 37 °C, myeloid cells were plated in methycellulose medium 
containing IL-3, IL-6, GM-CSF and stem cell factor. Transduced B cells were 
plated in methycellulose medium containing IL-7. 

Leukaemogenesis assays. Myeloid progenitors (wild type or Gsk3b-/~) trans- 
formed by MLL-ENL were transduced with different lentiviruses, selected for 
drug resistance and then transplanted (10° cells) by intravenous injection into 
sub-lethally irradiated (2Gy) C.B-17 scid/scid mice (6—8—-weeks-old). For 
lithium treatment, irradiated (1 Gy) C57BL/6 mice were transplanted with 
MLL-AF4-transformed B cell progenitors (50,000 cells) and maintained on 
0.4% lithium-carbonate-containing chow with saline water (Harlan Teklad). 
Lithium treatment was initiated 3 days before transplantation and continued 
for 30 days, at which point treatment was withheld for 5 days to allow recovery 
from drug-induced diuresis, and was then resumed. Development of acute leuk- 
aemia was confirmed by blood smear, peripheral blood leukocyte counts, FACS 
analyses and/or histology. 

Flow cytometry. Staining of cells for FACS analysis was performed as previously 
described” using conjugated antibodies obtained from either BD Pharmingen or 
eBioscience. Cell cycle assays using propidium iodide staining, and apoptosis 
assays using annexin V staining, were performed as described’. BrdU incorp- 
oration was determined using the BrdU flow kit (BD Pharmingen) according to 
the manufacturer’s instructions. 

Cell proliferation and MTT assays. Cultured cells (10,000—20,000) were plated 
in 96-well plates in R10, R20/20 or OP9 medium (100 pl) containing different 
concentrations of the indicated kinase inhibitors (Supplementary Table 1) and 
incubated at 37 °C. The numbers of viable cells were determined by trypan-blue 
dye exclusion at the indicated times using a haemocytometer. For MTT assays, 
cells were cultured for 3—4 days and then quantified using a cell proliferation kit 1 
under conditions recommended by the manufacturer (Roche). 

Western blot. Cells were lysed in buffer A (20 mM Tris, pH 7.5, 150 mM NaCl, 
1% Nonidet P-40, 1 mM EDTA) containing protease inhibitors (complete mini 
protease inhibitors; Roche). Proteins (40 jig) were subjected to SDS-PAGE and 
analysed by western blot using primary antibodies specific for GSK3 (Upstate 
Biotechnology), f-catenin (Upstate Biotechnology), phosph-GSK3 (Cell 
Signalling), AKT (Cell Signaling), tubulin (Sigma), p27 or p21 (Santa Cruz 
Biotechnology). 
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TMEMI16A confers receptor-activated 
calcium-dependent chloride conductance 


Young Duk Yang’, Hawon Cho’, Jae Yeon Koo', Min Ho Tak!, Yeongyo Cho', Won-Sik Shim’, Seung Pyo Park’, 
Jesun Lee’, Byeongjun Lee', Byung-Moon Kim', Ramin Raouf’, Young Ki Shin’ & Uhtaek Oh’ 


Calcium (Ca?*)-activated chloride channels are fundamental mediators in numerous physiological processes including 
transepithelial secretion, cardiac and neuronal excitation, sensory transduction, smooth muscle contraction and fertilization. 
Despite their physiological importance, their molecular identity has remained largely unknown. Here we show that 
transmembrane protein 16A (TMEM16A, which we also call anoctamin 1 (ANO1)) is a bona fide Ca?‘ -activated chloride channel 
that is activated by intracellular Ca2* and Ca?‘ -mobilizing stimuli. With eight putative transmembrane domains and no apparent 
similarity to previously characterized channels, ANO1 defines a new family of ionic channels. The biophysical properties as well as 
the pharmacological profile of ANO1 are in full agreement with native Ca*-activated chloride currents. ANO1 is expressed in 
various secretory epithelia, the retina and sensory neurons. Furthermore, knockdown of mouse Ano7 markedly reduced native 
Ca’*-activated chloride currents as well as saliva production in mice. We conclude that ANO1 is a candidate Ca** -activated 
chloride channel that mediates receptor-activated chloride currents in diverse physiological processes. 


Chloride (Cl) channels have fundamental roles in many physiological 
functions. Members of one group of these channels are activated by 
intracellular Ca”, and, accordingly, are collectively referred to as Ca” - 
activated chloride channels (CaCCs). CaCCs control the apical outflux 
of Cl, which is essential for the vectorial transport of electrolytes and 
water by means of secretory epithelia in kidneys, airways, intestine, 
pancreas and salivary glands’. In addition, CaCCs are also known to 
contribute to cardiovascular functions by controlling vascular endothe- 
lial as well as smooth muscle tone and by regulating cardiac myocyte 
excitability*’. Moreover, by changing membrane potentials, CaCCs also 
regulate neuronal cell excitability, which is responsible for controlling 
phototransduction, olfaction, gustation and somaesthetic sensations’. 
The CaCC-mediated cellular responses are initiated by the stimu- 
lation of G-protein-coupled receptors (GPCRs). Many GPCR ligands 
such as ATP, acetylcholine, endothelin-1, angiotensin II, lysophospha- 
tidic acid and histamine can evoke Ca” * -dependent chloride currents'*. 
Interestingly, despite their wide expression pattern, endogenous 
CaCCs exhibit very similar characteristic biophysical and pharmaco- 
logical properties: Ca”* -activated Cl" currents are sensitive to Cl" chan- 
nel blockers, show small channel conductances and have a voltage- 
dependent Ca’* sensitivity. Furthermore, their ion permeability 
follows the selectivity sequenceI >Br >Cl >F (refs 1, 6, 7). 
Because of their pathophysiological significance, many efforts have 
been made to clone CaCCs. CLCAI, isolated from bovine trachea, 
bestrophins related to Best disease and human CLC3 have been iden- 
tified as candidate CaCCs*"°. Although these candidate genes exhibit 
some of the properties of the native CaCCs, incontrovertible evidence of 
their involvement in native Ca”* -activated chloride currents is lacking 
or disputed'*"'. Here we characterize a new family of chloride channels, 
exemplified by ANOI, and provide evidence for their direct involve- 
ment in mediating endogenous Ca’* -activated chloride current. 


Characterization of mouse ANO1 


In our attempts to unravel the molecular identity of endogenous 
CaCCs, we searched public domain databases for putative channel- or 


transporter-like genes with more than two transmembrane domains 
and multiple isoforms. Amongst putative transmembrane proteins 
with unknown functions, TMEM16A attracted our attention because 
it has multiple putative membrane-spanning domains and ten homo- 
logues in man (Fig. la). Because the expressed sequence tags of 
TMEM 16A are frequently found in retina, full-length Tmem16acom- 
plimentary DNA was amplified using polymerase chain reaction with 
reverse transcription (RT-PCR) from total RNA isolated from mouse 
eye. Mouse Tmem16a cDNA contains an open reading frame of 2,880 
nucleotides that encodes a protein of 960 amino acids (Supplementary 
Fig. 1). Mouse TMEM16A has high homology (91% amino-acid 
sequence identity) with its human orthologue, and its predicted rat 
orthologue has an additional 123 amino acids at its amino terminus; 
the remainder of the rat sequence has high homology with mouse 
TMEM16A (99% identity). Interestingly, the TMEM16 family does 
not share significant sequence homology with other families of ion 
channels. 

Hydropathy analysis suggests that TMEMI16A has eight trans- 
membrane-spanning domains (Fig. 1b). Consensus analysis pre- 
dicted that ANO1 has multiple protein kinase A, protein kinase C, 
protein kinase G and casein kinase phosphorylation sites at intracel- 
lular protein segments and multiple glycosylation sites in extracellu- 
lar segments. Even though TMEM16A is activated by intracellular 
Ca** (see below), no specific consensus sites for Ca”* binding, such 
as the EF hand of voltage-gated Ca”* channels or the IQ motif of 
Ca”*-calmodulin, were found. Because TMEM16A has eight trans- 
membrane domains and an anionic channel property (see below), we 
tentatively called it anoctamin 1 (ANO1). 


ANO1 mediates a GPCR-activated chloride current 

Under physiological conditions CaCCs are activated by GPCR stimu- 
lation. Endothelin-1 is a powerful vasoconstrictor that depolarizes 
vascular smooth muscle by activating CaCCs*. We expressed mouse 
ANO1 and endothelin receptor subtype A (ET,R, also known as 
EDNRA) in HEK 293T cells (ANO1/ET,R-HEK) along with green 


"Sensory Research Center, CRI, College of Pharmacy, Seoul National University, Seoul 151-742, Korea. *Molecular Nociception Group, Neuroscience, Physiology and Pharmacology, 


University College London, London WCIE 6BT, UK. 
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fluorescent protein (GFP) for visual identification of transfected 
cells, and tested the current response to endothelin-1 stimulation 
in whole-cell configuration. Pipette and bath solutions contained 
140 mM N-methyl p-glucamine (NMDG)-Cl, which permitted Cl™ 
to be used as the main charge carrier. The pipette solution contained 
no added Ca** or EGTA. Under these conditions, application of 
50 nM endothelin-1 to ANO1/ET,R-HEK cells evoked robust inward 
currents when the membrane potential was held at —60 mV (Fig. Ic). 
In contrast, endothelin-1-evoked currents were minimal in HEK 
293T cells transfected with ET,R only or control (in which only 
GFP was transfected) cells. Similarly, the co-expression in HEK 
293T cells of ANO1 with rat angiotensin II receptor subtype 1 
(ATIR, also known as AGTRIA), muscarinic receptor subtype 1 
(M1R, also known as CHRM1), histamine receptor subtype 1 
(H1R, also known as HRH1) or purinergic receptor subtype 2 
(P2Y2R) (all of which are known to use native CaCCs for their 
cellular functions)'?""* resulted in robust current responses to angio- 
tensin II, carbachol, histamine and ATP, respectively (Fig. 1c). No 
significant currents were observed in control transfectants. 
Moreover, in ANO1-transfected cells, the application of endothe- 
lin-1, ATP or carbachol evoked small but significantly greater cur- 
rents than in control cells, presumably owing to the presence of 
endogenous receptors for these ligands in control HEK 293T cells 
(Fig. 1c). These results demonstrate that ANO1 can mediate 
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Figure 1| ANO1-transfected cells respond with robust inward currents to 
GPCR stimulation. a, Phylogenetic tree depicting the anoctamin family in 
human. Scale bar, 0.1 nucleotide substitutions per site. b, Predicted topology 
of mouse ANO1. ¢, Top panel: large inward currents were activated by 
endothelin-1 (50 nM), angiotensin II (1 1M, Angio II), carbachol (1 1M), 
histamine (100 1M) or ATP (100 uM) when mouse ANO1 was co- 
transfected into HEK 293T cells with ET,R, ATIR, MIR, HIR or P2Y2R. 
Enola = —60 mV. Dashed lines represent zero current. Lower panel: 
summary of receptor-stimulation-activated currents (experiment 
numbers = 5-18). **P < 0.001 compared to control (analysis of variance 
(ANOVA), Duncan’s post-hoc test). Error bars, mean + s.e.m. 
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receptor-activated chloride currents in HEK 293T cells in a similar 
manner to endogenously activated CaCCs. 


ANOT recapitulates the properties of endogenous CaCCs 

We next examined the current-voltage (I/V) relationship of endothe- 
lin-1-induced ANO1 currents (Fig. 2a). Voltage ramps from —80 to 
+80 mV were delivered to ANO1/ET,R-HEK cells. We found that 
during the early phase of the response, where the developing current 
was small, an outward-rectifying I/V relationship was observed. In 
contrast, a linear I/V relationship was observed during the peak cur- 
rent response. This is consistent with the linear or outwardly rectify- 
ing I/V relationships of endogenous CaCCs when intracellular Ca** 
concentrations are high or low®*'®!®. To determine the ionic select- 
ivity of ANO1, voltage ramps from — 80 to +80 mV were delivered to 
ANO1/ET,R-HEK cells. Endothelin-1 (50 nM) was applied to whole 
cells to evoke ANO1 currents; the pipette solution contained 140 mM 
NMDG-Cl (0mM Ca**, 0mM EGTA). When the bath solution 
contained equimolar NMDG-Cl, a linear J/V relationship that 
reversed near OmV (—4.5mV) was observed (Fig. 2b). 
Furthermore, when the bath solution was changed to 140mM 
sodium gluconate, no outward current was observed. Because 
NMDG*" and gluconate are incapable of passing through channels, 
these results demonstrate that ANO1 is an anion channel. 
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Figure 2| ANO1is a Cl” channel. a, Typical current—voltage relationships 
of endothelin-1-evoked currents at different time points in ANO1/ET,R- 
HEK cells. Voltage ramps: —80 to +80 mV, 320 ms duration, 0.83 Hz. 

b, Anion selectivity of endothelin-1-evoked currents in ANO1/ET,R-HEK 
cells. Pipette solution contained 140 mM NMDG-CL. Bath solutions 
contained 140 mM NMDG-C] or Na-gluconate. ¢, Anion permeability. Bath 
solution contained 140 mM NaCl, NaBr, NaI, NaNO; or NaF. d, Inhibition 
of endothelin-1-evoked currents in ANO1/ET,R-HEK cells by Cl channel 
blockers as well as classical inhibitors (n = 5 for each) of native CaCCs. 

10 uM of chemicals (except mefloquine, 5 1M) were applied 5 min before the 
second application of endothelin-1. Control (m = 12). NPA, n-phenyl- 
anthranilic acid. **P < 0.001 compared to control (ANOVA, Duncan’s post- 
hoc test). Error bars, mean + s.e.m. 
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The permeability sequence of monovalent anions is characteristic 
of CaCCs**"’, and, thus, we investigated the relative permeability of 
ANO1 to monovalent anions. For this experiment the pipette solu- 
tion contained 140 mM NMDG-Cl (0mM Ca**, 0mM EGTA). 
Permeability ratios (Px/Pc,) were estimated using the Goldman, 
Hodgkin and Katz equation from reversal potential changes induced 
by replacing extracellular Cl” with NO; , Br , I” or F '*. When 
NaCl in the bath was substituted with equimolar NaF, NaBr, Nal or 
NaNO; , reversal potentials shifted from —9.0+0.6mV (mean + 
s.e.m.; n= 5) to +12.542.9mV (n=8), —23.142.2mV (n=5), 
—24.7+2.5mV (n=5) and —29.1+1.1mV (n=7), respectively 
(Fig. 2c). Thus, the relative permeabilities of these monovalent 
anions were NO; (2.20)>I (1.85)>Br = (1.74) > Cl 
(1.0) >F (0.43), which is entirely consistent with those of endo- 
genous CaCCs (Supplementary Table 1)**"7. 

Agonist-induced currents in ANO1-expressing cells were blocked by 
Cl channel blockers, 4,4'-diisothiocyanatostilbene-2,2’-disulphonic 
acid (DIDS), niflumic acid or 5-nitro-2-(3-phenylpropylamino)- 
benzoic acid (NPPB)*?"’. In ANOJ/ET,R-HEK cells, 50nM 
endothelin-1 was applied twice with a 5 min interval, and second res- 
ponses were found to reach 87.1 + 11.4% (n= 12) of first responses, 
eliciting weak tachyphylaxis. Pretreatment or co-application of DIDS, 
niflumic acid and NPPB markedly inhibited this response of ANO1/ 
ET,R-HEK cells to the second application of endothelin-1 (Fig. 2d). In 
addition, modifiers of various enzymes or transporters such as fluox- 
etine (a serotonin reuptake inhibitor), tamoxifen (an oestrogen recep- 
tor modulator), n-phenyl-anthranilic acid (a cyclooxygenase 
inhibitor) and mefloquine (an antimalarial drug) are known to block 
native CaCCs**”. If ANO1 is a candidate CaCC, mouse ANO1 should 
be inhibited by these CaCC blockers. Indeed, these chemicals markedly 
inhibited endothelin-1-induced currents in ANO1/ET,R-HEK cells 
(Fig. 2d). This pharmacological profile further confirms that ANO1 
is a CaCC candidate. 


Mutation in the putative pore region of ANO1 changes ion 
permeability 


We looked to determine whether ANO1 subunits directly assembled 
into an ion channel that has a pore and a selectivity filter. To deter- 
mine whether ANO1 contains a pore region, we selected a highly 
conserved region by analogy with other homologues that lies between 
TM5 and TM6 consisting of 103 amino acids for mutation analysis 
(Supplementary Fig. 3). This region contains three positively charged 
residues (R621, K645 and K668) that are highly conserved among 
isoforms. We reasoned that mutating these positively charged resi- 
dues would affect the ionic properties of the channel if the residues 
are part of the pore region. When the positively charged residues were 
replaced with a negatively charged residue (glutamate), a marked 
reduction in anion permeabilities and a notable increase in cation 
permeability were observed (Supplementary Information). This 
mutational study suggests that the region spanning TM5 and TM6 
is a pore-forming region. 


ANO1 is activated by intracellular Ca2* 


To prove that ANO1 is activated by Ca", Ca** was applied to inside- 
out membrane patches isolated from ANO1/ET,R-HEK cells. Ca*~, 
applied to the bath (intracellular side) in various concentrations, 
evoked macroscopic single-channel currents in a dose-dependent 
manner (Ehoiq = —60 mV; Fig. 3a). A half-maximal concentration 
(ECs) for ANO1 activation was 2.6 uM at —60 mV. The threshold 
concentration of Ca** reached about 0.3 1M at —60 mV holding po- 
tential (Fig. 3b). Remarkably, ANO1 activation by Ca** was voltage- 
dependent, which is also characteristic of endogenous CaCCs”"*!®, At 
a depolarized membrane potential (+60 mV), ANO1 was more sen- 
sitive to Ca’* and the concentration-response curve shifted to the 
left, resulting in an ECs of 0.4 1M (Fig. 3b). Hill’s coefficients of the 
concentration—response curves were 2.0 and 2.4 at —60mV and 
+60 mV, respectively. Surprisingly, ANO1 current was activated less 
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by [Ca?*] > 10 1M, which is another indication of CaCC candidacy 
because native CaCC current is also inhibited by high Ca** (ref. 19). 
When 30 UM Ca’* was applied to inside-out patches from ANO1/ 
ET,R-HEK cells (Eyoig= —60mV) it reached to 53.34 17.1% 
(n= 5) of the current amplitude induced by 10 1M Ca’*. 

The amplitudes of single-channel ANO1 currents were small, like 
those of endogenous CaCCs*”’. At —60 mV, the average single-chan- 
nel current amplitude was 0.42 + 0.01 pA (n= 6; Fig. 3c). Moreover, 
we measured the single-channel current amplitudes of ANO1 acti- 
vated by Ca’* at various membrane potentials, and found a linear I/V 
relationship. In addition, the slope conductance of ANO1 was 8.3 pS, 
which is comparable to those of native CaCCs in various cell types*”°. 

Various GPCRs are known to activate CaCCs by means of the 
action of Go subunits of the G, class of G protein, which stimulates 
phospholipase C (PLC) to yield inositol-1,4,5-triphosphate 
(Ins(1,4,5)P3) and subsequent Ca?* release from internal stores!”. 
Thus, it is likely that ANO1 activation by GPCRs is mediated by 
the PLC/Ins(1,4,5)P3 signalling pathway. Indeed, we observed that 
ANO1 is activated by intracellular signals that increase intracellular 
Ca** concentrations, most notably by means of the PLC/Ins(1,4,5)P3 
pathway (Supplementary Information). ANO1 was also activated by 
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Figure 3 | ANO1 is activated by intracellular Ca?* in a voltage-dependent 
manner. a, Macroscopic current responses to repeated applications of Ca”* 
in various concentrations to the bath of inside-out patches isolated from 
ANO1/ET,R-HEK cells at holding potentials of —60 or + 60 mV. Pipette and 
bath solutions contained 140 mM NMDG-CL b, Dose-response relationship 
of ANO1 activation by Ca?*. Current responses were normalized versus 
those observed at 10 1M (circles, —60 mV, n = 8) or 3 UM Ca?* (triangles, 
+60 mV, n = 8). Data points were fitted to the Hill’s equation. Error bars, 
mean ~ s.e.m. ¢, Single-channel currents activated by Ca”* (0.5 1M) in an 
inside-out membrane patch. Signals were filtered at 2 kHz. d, An amplitude 
histogram of single-channel currents activated by Ca?* at +60 mV. 

e, Current—voltage relationship of single-channel Cl currents activated by 
Ca?* (n = 3-7). Error bars (mean + s.e.m.) are too small to be buried in 
symbols. 
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dialysing Ca** to whole cells as observed with endogenous CaCCs*”! 


(Supplementary Information). 


ANO1 expression pattern 


In immunohistochemical analysis for visualization of tissue distri- 
bution, ANO1 was highly expressed in tissues in which endogenous 
CaCCs have been described. As shown in Fig. 4a, dense ANO1 
immunoreactivity was observed in epithelial cells of pulmo- 
nary bronchioles, whereas expression in lung alveolar cells was less 


Figure 4 | ANO1 is expressed in transport epithelia and other tissues. 
Residues 451 to 466 of mouse ANO1 were chosen for generation of an ANO1 
polyclonal antibody for immunohistochemical analysis. a, Lung, dense 
ANO!1 immunoreactivity in epithelial cells in bronchioles (black arrow); 

b, pancreas acinar cells (black arrow); ¢, kidney, epithelia of proximal tubules 
(black arrow) and distal renal tubules (yellow arrow); d, retina, dense 
immunoreactivity in the outer nuclear (white arrow) and inner nuclear and 
ganglion cell layers (red arrow); e, dorsal-root ganglion, small (red arrow) 
and large (yellow arrow) sensory neurons; f, submandibular gland, ANO1 
immunoreactivity in acinar cells (black arrow). g, h, To verify the expression 
of ANO1 in the apical (luminal) membrane of acinar cells, submandibular 
gland was double-stained with antibodies of mouse ANO1 and aquaporin 5 
(AQP5, g) or NKCCI1 (h), marker antibodies for the apical membrane or the 
basolateral membrane of acinar cells in submandibular gland. 
Magnification, < 400. 
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prominent. Pancreatic acinar cells were also heavily stained by ANO1 
antibody (Fig. 4b). In the kidney, dense immunoreactivity was 
observed mainly in the epithelia of proximal (black arrow) and 
weakly in distal (yellow arrow) renal tubules (Fig. 4c). Remarkably, 
as has been proven by electrophysiological experiments”, dense 
ANO1 immunoreactivity was found in all retinal cell layers 
(Fig. 4d), that is, the outer nuclear layer, which contains the bodies 
of photoreceptors (white arrow), and the inner nuclear and ganglion 
cell layers (red arrow). Most sensory neurons but not satellite cells in 
dorsal root ganglia stained positively for ANO1 (Fig. 4e), in agree- 
ment with the modulatory effects of CaCCs in sensory transmis- 
sion”’. Small sensory neurons (red arrow) tended to stain more 
densely than large sensory neurons (yellow arrow), suggesting that 
ANOL has a modulatory role in nociception, because small neurons 
in dorsal-root ganglia are intimately involved in nociception”. In 
submandibular glands, dense immunoreactivity was observed at 
the apices of acinar cells colocalized with aquaporin 5, a marker for 
the apical membrane of acinar cells**° (Fig. 4g), but much less with 
NKCCI1 (also known as SLC12A2), a marker for the basolateral mem- 
brane of acinar cells*° (Fig. 4h). In contrast, we did not find ANO1 
immunoreactivity in the brain except for a weak immunoreactivity in 
neural tracts (data not shown). Furthermore, dense immunoreacti- 
vity was observed in Leydig cells and moderate immunoreactivity was 
observed in growing spermatocytes in testes (data not shown). Thus, 
the expression pattern of ANO1 in these secretory epithelial cells, 
retinal and sensory neurons coincides with the reported locations 
of CaCCs’”. 


ANO1 mediates the physiological responses of CaCCs 


Our biophysical and pharmacological evidence suggests that ANO1 
confers Ca’*-activated Cl” conductance. To explore the physio- 
logical functions of ANO1, we investigated its relationship with the 
various functions of endogenous CaCCs. In particular, we investi- 
gated the role of ANO1 in salivary secretion because much evidence 
indicates that CaCCs are present in the acinar cells of salivary 
glands*!. To address this issue, Anol small interfering RNA 
(siRNA) was injected into mice intravenously (retro-orbital injec- 
tion) to knockdown Ano1; scrambled siRNA was also administered as 
a control. To facilitate siRNA tissue uptake, it was mixed with a 
liposome-like material, polyethylenimine, before injection (see 
Methods). Four days after this siRNA administration, pilocarpine- 
induced saliva was collected to measure the rate of saliva production 
over 20 min, because the downregulation of Anol was expected to 
reduce saliva production. Pilocarpine, a muscarinic receptor agonist, 
administration (1 mg kg ', intraperitoneally (i.p.)) greatly increased 
salivary production in control mice, which peaked at 15 min after 
injection. The salivary production was not affected by scrambled 
siRNA, whereas Ano] siRNA significantly reduced peak salivary flow 
induced by pilocarpine (Fig. 5b). Consistent with reduction in sali- 
vation, acetylcholine-induced currents in primary cultures of sub- 
mandibular glands isolated from siRNA-injected mice were 
significantly smaller than those of control mice and scrambled 
siRNA-treated mice (P< 0.05, ANOVA, Duncan’s post-hoc test; 
Fig. 5c). As expected, ANO1 expression in submandibular glands 
was greatly reduced in siRNA-treated mice compared to those of 
control or scrambled siRNA-injected animals (Fig. 5a). The osmola- 
rities of saliva and concentrations of Na‘, K* and Cl in saliva 
samples of siRNA, scrambled siRNA-treated and control mice were 
similar (Supplementary Table 2). These results suggest that ANO1 is 
required for agonist-induced salivary secretion. 


Discussion 


Several candidate CaCC genes have been cloned. CLCA1 was initially 
isolated from bovine trachea’’. Some CLCA phenotypes are similar to 
those of native CaCCs!*’’. However, because of the different bio- 
physical properties of bovine CLCA2, CLCA does not well match 
the requirements of a CaCC’*!’. Moreover, bovine CLCA2 is now 
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Figure 5| Mouse Ano7 siRNA reduces pilocarpine-induced salivary 
output. a, Expression of ANO1 (arrow) in the submandibular glands of 
control, Anol siRNA-treated and scrambled siRNA-treated animals. 

b, Effects of Anol siRNA treatment on saliva production. Saliva output was 
measured in every 5 min in control (n = 10), siRNA-treated (n = 8) and 
scrambled siRNA-treated (1 = 10) mice. Pilocarpine (1 mg kg ') was 
injected intraperitoneally (arrow) to induce sufficient saliva production. 
**P < 0.01 compared to control (ANOVA, Duncan’s post-hoc test). Error 
bars, mean = s.e.m. ¢, Upper panel: representative traces of acetylcholine 
(Ach, 0.5 uM)-induced currents in cells of submandibular glands isolated 
from mice injected with Ano1 siRNA or scrambled RNA four days before 
culture. Lower panel: summary of current responses of acinar cells in 
submandibular glands. *P < 0.05 compared to control (ANOVA, Duncan’s 
post-hoc test). Error bars, mean + s.e.m. 


considered to be an endothelial adhesion molecule that mediates 
tumour cell adhesion**. Human CLCA3 seems to be a secreted pro- 
tein and to have no transmembrane domain after truncation”. 
Notably, mouse CLCA1 produces Ca’* -activated Cl” currents that 
resemble those of endogenous CaCCs when co-expressed with 
KCNMBI (a regulatory subunit of Ca**-activated K* channel)”. 
The bestrophins are proteins related to Best disease, a vitelliform 
macular dystrophy*. These proteins have biophysical properties that 
are similar to those of endogenous CaCCs, that is, direct activation by 
intracellular Ca** at physiological concentrations and the anion per- 
meability sequence®. In addition, mutation in amino acids alters 
anion permeabilities*!. Knockdown of bestrophin 1 reduces Ca**- 
activated Cl” conductances”. CLC3 (a homologue of the voltage- 
gated chloride channel CIC-0) has also been suggested to be a CaCC’. 
However, the general biophysical properties of CLC3 differ substan- 
tially from those of native CaCCs. Clc3'~ mice show Ca’ * -activated 
Cl” conductance**. These channels may therefore contribute to 
aspects of calcium-activated chloride flux, but other channels are also 
likely to be involved in this phenomenon. 

Cystic fibrosis is a life-threatening disease, in which Cl” secretion 
by airway and other secretory epithelia is reduced**. Cystic fibrosis 
transmembrane regulator (CFTR), a Cl” channel activated by intra- 
cellular cAMP, is responsible for the dysfunction in Cl” secretion in 
cystic fibrosis patients***’. However, compelling evidence suggests 
that CaCCs are also involved in Cl secretion in these tissues. In 
mice, CaCCs appear to compensate for CFTR loss of function; this 
is shown by no reduction in fluid secretion or apparent cystic fibrosis 
symptoms in CFTR-deficient mice**. Even though such functional 
compensation is not evident in human cystic fibrosis, evidence indi- 
cates that the activation of latent CaCCs could compensate for CFTR 
loss of function in cystic fibrosis patients*’”**. The treatment of 
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human airway epithelial cells with UTP, a purinergic receptor ago- 
nist, was found to cause Cl” conductance in cystic fibrosis epithelia”. 
These results suggest that CaCC activation could compensate for 
CFTR loss of function in cystic fibrosis, and that ANO1 may present 
an alternative target in cystic fibrosis. 

Recently, evidence has been presented indicating that ANO1 pro- 
teins participate in tumorigenesis. ANO1 is located in the 11q13 
human chromosome, where many tumour-related genes such as 
cyclin D1 are clustered*®. ANO1 is markedly upregulated in squam- 
ous cell carcinoma of the head and neck*' and in gastrointestinal 
stromal tumours*. The reason for this propensity of ANO1 in 
tumours is unknown, but it is conceivable that a secretory envir- 
onment might be necessary for tumour cell proliferation. The abund- 
ance of ANO1 in tumours may provide clues relating to the treatment 
or prediction of cancer in these tissues. 

ANO1 is thus the prototypical member of a structurally unique 
new family of ion channels. Its properties and distribution are con- 
sistent with its identity as the long sought CaCC. Because such chan- 
nels are implicated in a wide range of physiological functions, gene 
deletion studies should prove informative. In addition, these chan- 
nels may represent a new class of drug targets for a variety of diseases, 
ranging from cystic fibrosis to cancer. Further characterization of 
other members of the family should help define the significance of 
these new ion channels in health and disease. 


METHODS SUMMARY 

Cloning of mouse ANO1. First-strand cDNAs were reverse transcribed from 
total RNA isolated from mouse eye. The full-length coding sequence of Anol 
(Tmem16a,; NM_178642.4 gi: 110835695) was amplified by PCR, and subcloned 
into pSDTF for oocyte expression or into pEGFP-N1 for the expression in 
mammalian cells. 

Site-directed mutagenesis. All mutants of ANO1 were generated by site-direc- 
ted mutagenesis, substituting the central 1-2 nucleotides of the desired muta- 
genic site with two complimentary mutagenic primers using the Muta-Direct 
site-directed mutagenesis kit (iNtRON Biotech). 

siRNA. The Anol siRNAs and scrambled siRNA were synthesized and injected to 
the retroorbital sinuses in the orbits of 7-week-old male ICR mice with poly- 
ethylenimine for efficient target delivery to tissues in vivo. 

Salivary secretion test. Four days after siRNA injections, saliva was collected 
5 min before and 5, 10, 15 and 20 min after administration of pilocarpine-HCl 
(I1mgkg ', i.p.). A piece of pre-weighed cotton ball (~10 mg) was inserted 
below each tongue to measure the saliva volumes. The saliva-soaked cotton balls 
were removed, weighed and then spun to extract saliva for measurements of 
osmolarity and electrolyte content. 

Current recordings. Whole cells were formed with a ~3 MQ Sylgard-coated 
glass pipette. The junction potential was adjusted to 0 mV. Whole-cell currents 
were amplified using an Axopatch 200B amplifier (Molecular Device) and stored 
ina personal computer after digitization by a Digidata 1440 (Molecular Device). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cloning of mouse ANO1. The full-length coding sequence of mouse Anol was 
amplified from first-strand cDNAs by PCR using site-specific primers: forward 
primer 5’-CCGCTCGAGGCCACCATGAGGGTCCCCGAGAAG-3’, _ reverse 
primer 5'-GGGGTACCCTACAGCGCGTCCCCATGGTAC-3’. | pEGFP-N1 
contained the GFP gene in the vector. To express ANO1-GFP fusion protein, 
a stop codon was deleted from pEGFP-N1-mANO1 using the Muta-Direct site- 
directed mutagenesis kit (INtRON Biotech) because pEGFP-N1-mANO1 has a 
stop codon between the Anol and GFP coding sequences. 

Cloning of GPCRs. Full-length cDNAs of human ET,R (EDNRA), human 
P2Y2R (P2Y2R) and rat H1R (Hrh1) were obtained by PCR from single-strand 
cDNAs of HEK 293T, HaCaT and rat dorsal-root ganglion cells, respectively, and 
inserted into pCDNA3.1 (Invitrogen), pEGFP-N1 (Clontech) and pXOON (a 
gift from T. Jespersen). Rat ATIR (Agtrla) and human M1R (CHRM1) were 
donated by Y. S. Bae and C. H. Lee, respectively. 

siRNA. We designed three siRNAs that target different regions of the Anol 
transcript and were positioned at nucleotides 1602-1620, 1935-1953 and 
2288-2306. The targeted three siRNA sequences were: Anol_siRNA1 (coding 
sequence position: 1602-1620): sense 5’-GAGUCUUAGAGAAGUCACU 
(dTdT)-3', antisense 5’-AGUGACUUCUCUAAGACUC(dTdT)-3’; Anol_siRNA2 
(coding sequence position: 1935-1953): sense 5’-GAACAAUCUCUUCGA 
GAUU(dTdT)-3’, antisense 5’-AAUCUCGAAGAGAUUGUUC(dTdT)-3’; 
and Anol_siRNA3 (coding sequence position: 2288-2306): sense 5'-GACTT 
CAACCTCGAACCTT(dTdT)-3’, antisense 5’-AAGGUUCGAGGUUGAAG 
UC(dTdT)-3’. Negative control siRNA (scrambled siRNA): sense 5’-CCU 
ACGCCACCAAUUUCGU(dTdT)-3’, antisense 5’-ACGAAAUUGGUGGC 
GUAGG(dTdT)-3’.. The Anol siRNAs and _ negative-control siRNA 
(scrambled siRNA) were synthesized (Bioneer Co). 

For efficient target delivery to tissues in vivo, Anol siRNAs (10 ig each) were 
injected with polyethylenimine (in vivo jetPEI, Polyplus-transfection Inc). The 
siRNA-polyethylenimine complex (20011) was injected to the retro-orbital 
sinuses in the orbits of 7-week-old male ICR mice. 

To knockdown endogenous Xenopus laevis Anol (xAno1), endonuclease- 
digested siRNA (esiRNA) was prepared using a Silencer siRNA Construction 
Kit (Ambion). Long double-stranded RNAs corresponding to nucleotides 1205- 
1503 of xAnol (NM_001096548, gi: 148228306) were transcribed, purified and 
digested using RNase III and injected (1 ng per 50 nl water) to oocytes. Silencer 
Negative Control number 1 siRNA (19 bp scrambled sequence with 3’ dT over- 
hangs, Ambion) was used as a negative control. Gene silencing was confirmed by 
RT-PCR. 
Salivary secretion test. To measure the saliva volumes, anaesthetized mice 
(pentobarbital, 50 mg kg _', i.p.) were placed on a slight slope with heads facing 
downwards to avoid swallowing. A piece of pre-weighed cotton ball (~10 mg) 
was inserted below each tongue, and every 5 min the saliva-soaked cotton balls 
were removed, weighed, and then spun to extract saliva. Saliva osmolarities were 
measured using a Micro-Osmometer (Model 210, Fiske Associates), and Na‘, 
Cl” and K* concentrations in extracted saliva were determined using an 
Autobiochemistry Analyser (Hitach 7070). Mice that vomited after pilocarpine 
injection were excluded. 

Antibody production. The polyclonal antibody to mouse ANO1 was raised in a 
rabbit using a synthetic peptide affinity-purified by protein A-Sepharose column 
corresponding to the cytosolic region of ANO1 between TM2 and TM3 (451-466 
amino acid: KDHPRAEYEARVLEKS; AbFrontier Co). Anti-mouse ANO1 


nature 


polyclonal antibody was purified by protein A-Sepharose column chromato- 
graphy followed by antigen peptide-specific affinity column chromatography. 
The specificity of purified antibody was confirmed by immunoblot. 
Immunohistochemical staining of ANO1. Sections (4 um thickness) of 10% 
formalin-fixed, paraffin-embedded tissues were placed on silanized glass slides, 
deparaffinized three times with xylene, rehydrated in an alcohol series, and 
immersed in 0.3% HO . Sections were incubated with 1:800 diluted rabbit 
anti-mouse ANO1 polyclonal antibody followed by incubation with dextran 
polymer conjugated with goat anti-rabbit antibody and horseradish peroxidase 
(Envision plus kit, DAKO). The chromogen was developed over 5 min with 
liquid 3,3'-diaminobenzidine. The sections were counter-stained with Meyer’s 
haematoxylin. Controls with normal rabbit serum and with blocking peptide 
were processed in parallel. 

For immunofluorescent staining, frozen sections in 20-40 tm thickness of 

mouse submandibular glands were mounted on slide glass, fixed with methanol 
for 15min, washed, and incubated with rabbit ANOI polyclonal antibody 
(diluted 1:400, in blocking solution) and goat AQP5 polyclonal antibody 
(diluted 1:200, Santa Cruz Biotechnology) or goat NKCC1 polyclonal antibody 
(diluted 1:200, Santa Cruz Biotechnology) overnight at 4 °C. After wash, samples 
were incubated with Alexa Fluor 594-conjugated donkey anti-rabbit IgG 
(Molecular Probe, Inc., diluted 1:200) or Alexa Fluor 488-conjugated chicken 
anti-goat IgG (Molecular Probe, Inc., diluted 1:200) as a secondary antibody 
for Lh. 
Current recordings. To form gigaseals, individual cell surfaces were touched 
with a ~3 MQ Sylgard-coated glass pipette with gentle suction. The membrane 
in contact with the pipette was then ruptured by suction to form a whole cell. The 
junction potential was adjusted to OmV. For ion selectivity experiments, 
140 mM NMDG-C] in the control bath solution was replaced with equimolar 
NaCl, Nal, NaBr, NaNOs, NaF or Na-gluconate. For calculation of cation per- 
meability, the pipette solution contained 70 mM NaCl, 2mM MgCl, 70mM 
sucrose and 10mM HEPES (pH 7.2) and the bath solution contained 210 mM 
NaCl, 2mM MgCl, and 10 mM HEPES (pH 7.2). To determine current-voltage 
relationships, voltage ramps from —80 to +80mV were delivered for 350- 
400 ms to whole cells. Whole-cell currents were amplified using an Axopatch 
200B amplifier (Molecular Device) and stored in a personal computer after 
digitization by a Digidata 1440 (Molecular Device). 

For single-channel current recordings, the pipette and control bath solution 

contained 140mM NMDG-Cl, 2mM MgCl, and 10mM HEPES (pH7.2). 
Currents were amplified, filtered at 2 kHz, and stored in a data recorder, and 
when required these data were exported in a sampling rate of 100 KHz to a 
personal computer for current amplitude analysis. To create amplitude histo- 
grams, pClamp software (version 6.0, Molecular Device) was used to analyse 
open and closed single-channel current events. The half-amplitude algorithm 
was used to determine an open event. The minimum duration of an open event 
was set at 0.1 ms. 
Oocyte recording. pSDTF-mANOI was linearized using the BamHI restriction 
enzyme and transcribed with SP6 RNA polymerase (Ambion) to yield comple- 
mentary RNA. After defolliculation with collagenase, Xenopus laevis oocytes 
were injected with 50 ng of the complementary RNA of ANO1 in 50 nl of diethyl- 
pyrocarbonate-treated water. Two to five days after injection, whole-cell cur- 
rents were recorded using the two-electrode voltage-clamp technique. The 
bath solution containing (in mM) 96 NaCl, 1 MgCh, 5 HEPES, 2 KCl and 1.8 
CaCl, adjusted at pH 7.5 was perfused at 2 ml min” '. 
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Squeezing and entanglement in a Bose-Einstein 


condensate 


J. Estéve!, C. Gross', A. Weller’, S. Giovanazzi! & M. K. Oberthaler’ 


Entanglement, a key feature of quantum mechanics, is a resource 
that allows the improvement of precision measurements beyond 
the conventional bound attainable by classical means’. This results 
in the standard quantum limit, which is reached in today’s best 
available sensors of various quantities such as time” and posi- 
tion**. Many of these sensors are interferometers in which the 
standard quantum limit can be overcome by using quantum- 
entangled states (in particular spin squeezed states**) at the two 
input ports. Bose-Einstein condensates of ultracold atoms are 
considered good candidates to provide such states involving a 
large number of particles. Here we demonstrate spin squeezed 
states suitable for atomic interferometry by splitting a condensate 
into a few parts using a lattice potential. Site-resolved detection of 
the atoms allows the measurement of the atom number difference 
and relative phase, which are conjugate variables. The observed 
fluctuations imply entanglement between the particles’, a 
resource that would allow a precision gain of 3.8 dB over the stan- 
dard quantum limit for interferometric measurements. 

Spin squeezing was one of the first quantum strategies proposed to 
overcome the standard quantum limit, in a precision measurement”® 
that triggered many experiments'*'’. It applies to measurements 
where the final readout is done by counting the occupancy difference 
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between two quantum states, as in interferometry or in spectroscopy. 
The name ‘spin squeezing’ originates from the fact that the N part- 
icles used in the measurement can be described by a fictitious spin 
J = N/2. In an interferometric sequence, the spin undergoes a series 
of rotations in which one of the rotation angles is the phase shift to be 
measured. A sufficient criterion for the input state, allowing for 
quantum-enhanced metrology, is given by €s<1, where 
& =2JAJ2/(Ue)’ + Uy)”) is the squeezing parameter introduced in 
ref. 6. The fluctuations of the spin in one direction have to be reduced 
below shot noise (here 4 de <J/2), and the spin polarization in the 
orthogonal plane, (J,)” + ¢ ig has to be large enough to maintain the 
sensitivity of the interferometer. A pictorial representation of this 
condition is shown in Fig. 1b. The precision of such a quantum- 
enhanced measurement is €/\/N, whereas the standard quantum 
limit set by shot noise is 1//N. 

In this Letter, we report the observation of entangled squeezed 
states in a Bose-Einstein condensate of *’Rb atoms. The particles 
are distributed over a small number of lattice sites (between two 
and six) in a one-dimensional optical lattice (Fig. 1a). The occu- 
pation number per site ranges from 100 to 1,100 atoms. The two 
modes supporting the squeezing are two states of the external atomic 
motion corresponding to the condensate mean-field wavefunctions 
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Figure 1| Observing spin squeezing in a Bose-Einstein condensate 
confined in a double- or six-well trap. a, The atoms are trapped in an optical 
lattice potential superimposed on an harmonic dipole trap. The number of 
occupied sites is adjusted by changing the confinement in the lattice 
direction. High-resolution imaging allows us to resolve each site. b, Gain in 
quantum metrology is obtained for spin squeezed states exhibiting reduced 
fluctuations in one direction (z) and a sufficiently large polarization in the 
orthogonal plane (x, y) as depicted on the Bloch sphere. For our system, spin 
fluctuations in the z direction translate to atom number difference 
fluctuations An between two adjacent wells. The polarization of the spin in 


2an 
IN 


the x-y plane is proportional to the phase coherence, (cos ¢), between the 
wells. c, The atom number fluctuations at each site are measured by 
integrating the atomic density obtained from absorption images. We 
compare a typical histogram showing sub-Poissonian fluctuations in the 
atom number difference with the binomial distribution (red curve). The 
green curve corresponds to the deduced distribution after subtracting the 
photon shot noise, leading to a number squeezing factor of &, = —6.6 dB. 
d, The phase coherence is inferred from the interference patterns between 
adjacent wells. The histogram shown corresponds to a phase coherence of 
(cos ~) = 0.9. 
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in two adjacent lattice sites. These modes are spatially well separated 
and thus represent an ideal starting condition for a spatially split 
interferometer. The fictitious spin components can be defined as 
J. = (a'b + b'a)/2, Jy = i(a*b — bha)/2 and J, = (a'a — b'b)/2, where 
a‘ and b' denote the respective creation operators associated with the 
two modes, aand b denote the corresponding annihilation operators, 
and i=/—1. The zcomponent corresponds to half the atom num- 
ber difference between the wells. Because the mean occupation num- 
bers, 1, and n,, in the two wells are large, the expectation values of the 
xand y components can be approximated by (J,)~./7aMp(cos @) and 
Jy)~V/Nany(sin #), respectively, where ¢ is the phase difference 
between the two macroscopic wavefunctions. 

Spin squeezing by means of unitary evolution requires a nonlinear 
component in the Hamiltonian*; this is provided by the repulsive 
interactions between the atoms of the condensate. The correspond- 
ing suppression of atom number fluctuations in a Bose-Einstein 
condensate has been indirectly observed'***. However, by definition 
of the squeezing factor €s, its experimental determination requires 
access to the local properties of the atoms occupying the two sites of 
interest. By imaging the condensate with a resolution of 1 1m (full- 
width at half-maximum), which is well below the lattice spacing of 
5.7 um, we fulfil this criterion of local measurement. The wells of the 
lattice are fully resolved (Fig. 1a), which allows for the determination 
of the atom number in each lattice site by direct integration of the 
atomic density as obtained by absorption imaging. Local interference 
measurements after a condensate expansion time short enough that 
only neighbouring sites overlap reveal the phase between these wells. 
In Fig. 1c, d we display typical data sets for the two types of measure- 
ment. The technical details of the precise experimental procedures 
and calculations used to deduce the number squeezing factor and the 
phase coherence are given in the Supplementary Information. The 
fluctuation measurement of the two conjugate variables, number and 
phase, yields information about the quantum state of the system and, 
in particular, allows for the detection of macroscopic entanglement 
between the particles. 

Figure 2 summarizes the conjugate number—phase measurements 
in different experimental situations. The vertical axis corresponds to 
the number squeezing parameter, e =A RB /A Ie ref? Which measures 
how much the fluctuations of the atom number difference are sup- 
pressed in comparison with a binomial distribution with variance 
AJ peg =Natv/N, which is expected in a non-squeezed situation. The 
phase coherence, (cos #), between the two wells defines the horizontal 
axis. We choose the origin of the phase such that the J, component 
has a zero mean value. In this case, the relevant squeezing parameter 
for quantum metrology, €s, is given by €5 = Ey/(cos #). In the fol- 
lowing, we will refer to it as the coherent number squeezing para- 
meter. Lines corresponding to €;=—3dB and = —6dB are 
plotted in the figure. The different data points correspond to differ- 
ent preparations of the atoms in the lattice, as detailed in the legend of 
Fig. 3, and to different numbers of occupied sites. The simultaneous 
observation of number squeezing and high phase coherence (solid 
symbols) reveals the presence of coherent number squeezing. In the 
double-well and six-well situations, we deduce best squeezing factors 
of € = —2.3 dB and & = —3.8 dB, respectively. The statistical error 
bars and bounds for systematic errors are indicated in the figure. The 
open symbols show that atom number fluctuations can be further 
suppressed at the expense of lower phase coherence. In the six-well 
situation, we observe number squeezing down to & = —7.2 dB, 
which corresponds to fluctuations of 15 atoms per well, out of 
1,100. The inset in Fig. 2 shows the optimal number squeezing for 
a given phase coherence” in our experimental situation with 2,200 
atoms in a double well, revealing that there is still great potential for 
improvement. Our best measurements yield number fluctuations 
approximately 25 dB higher than the Heisenberg limited states with 
the same phase coherence. 

Entanglement in the context of spin squeezing has been intensively 
discussed’. In a first quantization approach, it can be defined as the 
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nonseparability of the N-body density matrix. With this definition, a 
sufficient criterion for entanglement coincides with the criterion for 
quantum metrology, namely €s < 1, which identifies spin-squeezing- 
type entanglement asa useful resource’. In the context of indistinguish- 
able bosons, as in our experiment, the number squeezing Ey < 1 has 
been shown to be a sufficient criterion for the nonseparability of the 
reduced two-body density matrix*”. This criterion is satisfied for all the 
measurements shown in Fig. 2. However, without the possibility of 
accessing the single-particle spin properties (by contrast to the case 
ofan ion string”), this type of bipartite entanglement may not be usable 
as a resource. For this reason, only measurements in the coherent- 
number-squeezed region (solid symbols) indicate the definite presence 
of useful entanglement. 

To identify what limits the amount of squeezing, we consider the 
two-mode Josephson Hamiltonian EcJ? /2 —25, Jx/N, that describes 
two weakly coupled condensates. The Josephson energy, Ej, and the 
charging energy, Ec, respectively characterize the tunnelling rate 
between the two condensates and the repulsive interaction energy 
inside each well. Because the depletion of the condensates in each 
well is small (=12 atoms) we neglect intrawell excitation. Longer- 
wavelength excitations that exist ina many-well situation are also not 
considered. For the purpose of identifying the limiting factors on 
squeezing, this simplified model suffices because it captures the cor- 
rect scaling of next-neighbour fluctuations with temperature, tun- 
nelling rate and interaction energy”®. In the two-mode model and at 
temperatures T high in comparison with the plasma energy, \/EcEj/ 
kg, the thermal excitation of the Josephson plasma mode limits the 
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Figure 2 | Number squeezing and phase coherence as a criterion for 
quantum metrology and entanglement. Measurements are shown for the 
two main well pairs of a six-well lattice (red and blue circles) and for a 
double-well potential (green diamonds). The total atom number, N, in each 
pair is approximately 2,200 in the six-well case and 1,600 in the double-well 
case. Filled and open symbols respectively represent two different 
experimental regimes corresponding to different final lattice depths (see 
Fig. 3). For filled symbols, number squeezing and high phase coherence are 
simultaneously observed, whereas for the open symbols the phase coherence 
is degraded. The limits for different quantum metrology gains € are plotted. 
In the coherent-number-squeezed region (below the orange line), directly 
usable entanglement is necessarily present in the system. The shaded areas 
show systematic error bounds due to a possible miscalibration of the atom 
number (+20%) and to an underestimation of the phase coherence caused 
by technical noise. The error bars indicate standard deviation deduced from 
at least 400 experimental realizations. The inset (same quantities as main 
panel) shows that our data are approximately 25 dB above the optimal 
allowed number squeezing (purple line), showing room for improvement”. 
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number squeezing to €,~kgT/ju, where ju is the chemical potential 
that measures the strength of the interaction (Ec ~ w/N) and kg is 
Boltzmann’s constant. In a typical experimental situation at thermal 
equilibrium, these fluctuations place a strong limit on both number 
and coherent number squeezing, as it is difficult to evaporatively cool 
the condensate much below the chemical potential. 

To circumvent this limitation, we prepare the condensate in a 
shallow lattice before increasing the lattice depth. During this split- 
ting process, the tunnelling rate, Ej, decreases from its initial value, 
EM , to a final value, EI . The interaction energy, Ec, stays almost 
constant in our experimental situation. For sufficiently slow ramp 
speeds, the evolution is expected to be adiabatic. Because the energy 
of the collective Josephson mode decreases during the splitting, 
energy is removed from the system, leading to an effective cooling 
of the relevant degree of freedom. In the regime where only the linear 
part of the collective spectrum is populated, the effective temperature 


evolves as Tere = T/ BE / BY leading to a final number squeezing 
Ong ona tea! Ty where ae =4kgT/N Ec is the initial squeezing 
at equilibrium with temperature T. The optimal number squeezing, 
assuming adiabatic evolution, is obtained by reducing further the 
Josephson energy to the Fock regime where the collective spectrum 
is quadratic. The final number squeezing is then given by 
EG i(keT/25)”). 

We investigate our assumption of adiabaticity by performing a 
series of experiments in the six-well situation. The lattice depth is 
ramped from the initial value at which the condensate is obtained 
(430 Hz) to a fixed end value (1,650 Hz) in different times. In Fig. 3a, 
the final number squeezing factors for the two most populated well pairs 
are plotted versus the total ramping time. As expected, no improvement 
in the number squeezing is observed for ramps that occur too quickly 
(<20 ms). For slower ramps, the number squeezing increases and 


saturates at —6.6*'6*?8 dB, where the given uncertainties are 
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lo statistical errors obtained from approximately 1,000 experi- 
mental realizations followed by bounding of the systematic error 
(see Supplementary Information). This improvement in number 
squeezing by a factor of approximately three is a consequence of the 
effective cooling discussed above. As shown in Fig. 3a, the observed 
behaviour is reproduced by a numerical simulation of the splitting 
process with the two-mode Josephson Hamiltonian. 

High phase coherence is the second ingredient for a coherent 
number squeezing factor 5 < 1. This requires that the effective tem- 
perature of the Josephson mode be below the tunnelling coupling E,/ 
kx, which is the case for the initial lattice depth”’. Increasing the lattice 
depth decreases the phase coherence. Best coherent number squeez- 
ing is obtained by adiabatically increasing the lattice depth to an 
optimal value such that coherence is still high and number squeezing 
has occurred. In order to find this optimum, we investigate the 
number squeezing and the phase coherence as a function of tunnel- 
ling coupling. We linearly ramp up the barrier height to different end 
values, keeping the ramp speed in the adiabatic regime. Figure 3 
presents the results obtained for six (Fig. 3b) and two (Fig. 3c) occu- 
pied wells. In the six-well case, we identify an optimum barrier height 
range between 650 and 900 Hz, where we deduce a best coherent 
number squeezing €; = —3.8*$-3*98 dB averaging over all the mea- 
surements. In the double-well situation, averaging all the points 
between 650 and 1,200Hz, we obtain €5= —2.31)2+°8 dB. The 
deduced mean phase coherence and number squeezing in the 
optimal regions correspond to the solid data points shown in Fig. 2. 

For the best observed number squeezing, we measure atom number 
fluctuations just above the detection threshold of our absorption 
imaging technique. To reach this sensitivity level, special care has to 
be taken to calibrate the deduced atom number” (see Supplementary 
Information). Furthermore, the contribution of the photon shot noise 
has to be precisely measured and subtracted”. As an independent 
check of the reliability of the atom counting, we monitor the evolution 
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Figure 3 | Systematics of number squeezing and phase coherence during 
the splitting of a condensate. a, Investigating the adiabaticity condition for 
the evolution of the many-body state during the splitting process. The 
number squeezing factors for the two most populated well pairs (blue and 
red circles) are measured after ramping up the lattice depth from the value at 
which the condensate is obtained (430 Hz) to a fixed end value (1,650 Hz), 
for anumber of different total ramp times (see inset). Numerical simulations 
using the two-mode Josephson model reproduce the general observed 
behaviour, assuming an initial thermal population of the density matrix 
corresponding to temperatures between 20 nK and 40 nK (grey shaded area). 
b, Number squeezing and phase coherence for different final lattice depths in 
the six-well case. The ramp speed is fixed to 4Hzms_' (300-ms ramp time in 
a) to satisfy the adiabaticity criterion. Number squeezing improves before 
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the phase coherence drops, leading to coherent number squeezing. The grey 
shaded area shows the predictions of the two-mode model assuming an 
adiabatic evolution of the density matrix with initial temperatures between 
10 nK and 30 nK (three to ten populated many-body states). c, Same as b but 
for a double well. The ramp speeds are 2Hzms | and 8Hzms ‘ for the 
triangles and diamonds, respectively. The symbols shown in Fig. 2 
correspond to the averages of all the data measured for barrier heights 
between 650 and 900 Hz (filled symbols) and above 1,300 Hz (open symbols) 
in the six-well case (b) and between 650 and 1,200 Hz (filled symbols) and 
above 1,400 Hz (open symbols) in the double-well case (c). Some data 
measured at very high lattice depth in the double-well case that participate in 
the averaging are not shown here. 
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Figure 4 | Random loss restores Poissonian fluctuations. With the atoms 
held in the lattice (six occupied sites), the atom number in the decoupled two 
main well pairs decreases as shown in the inset, owing to one- and three-body 
loss. Red circles show the evolution of number squeezing starting from a 
squeezed situation. The solid line is the prediction corresponding to the 
measured loss rates and initial squeezing. Blue diamonds show that in the 
absence of initial squeezing no further evolution is observed. The consistent 
behaviour of the squeezing with loss is an independent check of the validity 
of our squeezing measurement technique. A more quantitative test is 
performed by holding the atoms for 10s and measuring the number 
squeezing inferred from 1,000 realizations. The initial state is slightly 
squeezed (—3 dB < & <0 dB) and a squeezing factor 

—1.2dB < & <—14B (green shaded area) is expected at the time of 
measurement, when two-thirds of the atoms have been lost. We measure 
& = —0.7107 dB, which is consistent with the expected value within the 
95% statistical confidence bounds indicated by the error bar. The small 
improvement of number squeezing in the upper boundary of the green 
shaded area arises from the nonlinear dependence of three-body loss on 
atom number. After a sufficient time, the one-body loss dominates and 
Poissonian fluctuations are restored. 


of number squeezing with atom loss. It is well known in quan- 
tum optics that random loss processes rapidly degrade the number 
squeezing. Red circles in Fig. 4 show this restoring effect with a rate 
that is compatible with the measured one- and three-body loss rates. A 
further check is performed by monitoring the evolution of the number 
fluctuations for an initial state with almost Poissonian fluctuations; 
such a state is prepared by directly condensing the atoms in a deep 
lattice. No change in the number squeezing is observed through the 
loss process, within the statistical errors. During the measurement, the 
relevant parameters for the imaging, such as the extension of the cloud 
and its optical density, are kept constant. 

To confirm the successful implementation of a quantitative atom 
number fluctuation measurement, we infer number squeezing from 
1,000 measurements after a 10-s holding time, starting from a slightly 
squeezed situation (—3dB< & <0 dB). At this time, two-thirds of 
the atoms are lost and a precise number squeezing factor 
—1.2dB< & <~—14B is predicted from the measured one- and 
three-body loss coefficients. This exemplifies how particle losses 
can be used to prepare a well-defined number distribution in a con- 
densate. We measure &, = — 0.7107 dB, where the indicated uncer- 
tainties are 95% statistical confidence bounds, in quantitative 
agreement with the expected number squeezing. 

The measured squeezing presented here concerns the external 
degree of freedom of the atoms in the condensate. It is comparable 
in amount to the latest measured squeezing of internal atomic states, 
also obtained by unitary evolution of a nonlinear Hamiltonian'*. 
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We show that the achieved entanglement can be directly used as a 
resource for quantum metrology with spatial atom interferometers. 
This is a major step towards measuring at the ultimate, Heisenberg, 
limit with a large number of particles. 
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Marginal breakdown of the Fermi-liquid state on the 
border of metallic ferromagnetism 


R. P. Smith’, M. Sutherland’, G. G. Lonzarich’, S. S. Saxena’, N. Kimura’, S. Takashima’, M. Nohara® & H. Takagi>* 


For the past half century, our understanding of how the interactions 
between electrons affect the low-temperature properties of metals 
has been based on the Landau theory of a Fermi liquid’. In recent 
times, however, there have been an increasingly large number of 
examples in which the predictions of the Fermi-liquid theory appear 
to be violated’. Although the qualitative reasons for the breakdown 
are generally understood, the specific quantum states that replace 
the Fermi liquid remain in many cases unclear. Here we describe an 
example of such a breakdown where the non-Fermi-liquid prop- 
erties can be interpreted. We show that the thermal and electrical 
resistivities in high-purity samples of the d-electron metal ZrZn, at 
low temperatures have T and T°? temperature dependences, 
respectively: these are the signatures of the ‘marginal’ Fermi-liquid 
state*’, expected to arise from effective long-range spin-spin inter- 
actions in a metal on the border of metallic ferromagnetism in three 
dimensions**. The marginal Fermi liquid provides a link between 
the conventional Fermi liquid and more exotic non-Fermi-liquid 
states that are of growing interest in condensed matter physics. The 
idea of a marginal Fermi liquid has also arisen in other contexts— 
for example, in the phenomenology of the normal state of the copper 
oxide superconductors’, and in studies of relativistic plasmas and of 
nuclear matter**°. 

The low-temperature properties of metals are traditionally 
described in terms of the concept of elementary excitations. These 
excitations include the quanta of motion of the crystal lattice, or 
phonons, and the quanta of motion of the conduction electron sys- 
tem, or quasiparticle-quasihole pairs. The density of elementary 
excitations in thermal equilibrium decreases with decreasing temper- 
ature and vanishes as T— 0 K. If the interactions between these exci- 
tations are short range, we then expect the characteristic lifetime 
between collisions, t, to diverge with decreasing temperature: that 
is, we expect the scattering rate, 7 ', to vanish as T—> 0. A key prop- 
erty of the Fermi-liquid state is that the quasiparticle scattering rate 
vanishes specifically as the square of the temperature, that is, 7! « 
T°. This leads to a T° temperature dependence of both the thermal 
resistivity, w, and the electrical resistivity, 9, a behaviour character- 
istic of conventional metals at low temperatures. 

The Fermi-liquid state can break down if the quasiparticle inter- 
actions are attractive or if they are long range. In particular, the 
conventional (relativistic) current—current interaction between 
charge carriers remains long range in a normal metal and thus could, 
at least in principle, lead to a non-Fermi-liquid form of the relaxation 
rate t | at sufficiently low temperatures. For an ideally pure system 
in three dimensions (3D), the current—current interaction is expected 
to give rise to a quasiparticle scattering rate of the anomalous form 
t | x T log(T*/T) in the T—> 0 limit, where T* is a temperature 
scale°. This leads to a linear temperature dependence of the thermal 
resistivity, wo T, and a five-thirds power law for the electrical 


resistivity, p x T°? (refs 4-6). This is a crucial difference from the 
Fermi-liquid state, as it means that the characteristic uncertainty in 
the energy of a thermally excited quasiparticle (proportional to t” ') 
diverges in comparison with its characteristic energy (proportional 
to T). This effectively means that there are no well-defined fermionic 
excitations and that there is, evidently, a breakdown not only of the 
Fermi-liquid state but also potentially of the concept of elementary 
excitations itself, as it is traditionally understood. The state in which 
the ratio t '/T diverges logarithmically is the marginal case in which 
the quasiparticles cease to exist. 

This is an example ofa type of marginal Fermi liquid that arises as a 
result of long-range (non-local) effective quasiparticle 
interactions® °. The term ‘marginal Fermi liquid’ was first introduced 
to describe a different problem, involving local interactions in a 
model of the high-temperature copper oxide superconductors’. In 
both local and non-local marginal Fermi liquids, the thermal resistiv- 
ity, w, is linear in T at low T. However, crucially, the electrical resis- 
tivities differ in the two cases, namely, p varies as T and as T°? in the 
local and above non-local marginal Fermi liquids, respectively. We 
shall be concerned here solely with the non-local case in which the T 
dependences of w and p differ such that the ratio w/p does not 
saturate at low T, but diverges as T 7°. 

The current—current interaction is important in relativistic plas- 
mas, but can usually be ignored for normal metallic densities under 
achievable experimental conditions. A marginal Fermi-liquid state 
might arise at readily accessible conditions, on the other hand, owing 
to the effects of long-range spin-spin interactions in pure metals on 
the border of ferromagnetic order at low temperatures. The problem 
of long-range spin-spin interactions has been described by the self- 
consistent-renormalization (SCR) approximation® and _ related 
approximations**”, and also by the renormalization group tech- 
niques applied to quantum phase transitions and quantum critical 
phenomena”. The predictions of these approaches for the transport 
properties are qualitatively similar, and are summarized in Table 1 
for a 3D itinerant-electron system on the border of ferromagnetism. 

The SCR model reduces to the Fermi-liquid (FL) model below a 
temperature scale Tp,, which tends to vanish at the critical pressure or 
critical composition where ferromagnetic order disappears. Above 
Try but below another crossover temperature Typ (to be discussed 
below in a particular case), the SCR model reduces to the marginal 
Fermi liquid (MFL) if the ferromagnetic transition is continuous®”. 
The marginal Fermi-liquid regime has been difficult to identify un- 
ambiguously because (1) the transitions of interest are typically first 
order rather than continuous'’”’, (2) phonons can mask the intrinsic 
electronic state at intermediate temperatures (that is, 
Tp, < T< Twer) and (3) quenched disorder not included in the 
SCR model can play an important role in some of the key materials 
investigated thus far’>™’. 
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Table 1| Temperature dependences of transport properties 
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Transport property Fermi liquid Marginal Fermi liquid 
Electrical resistivity p=pot AT” Fi P=pot at 
Thermal resistivity, w= LoT/« w=WotAT~ + BT’ W=WotaT’?+bT 
Thermal minus electrical resistivities, 6 = w— p 6=BT? 0=bT 


Results are shown for a Fermi liquid and for a marginal Fermi liquid with long-range spin-spin interactions on the border of ferromagnetism>°"°, For a comparison of the temperature dependences of 
other physical properties, see, for example, refs 8, 9. The specific heat for the marginal Fermi liquid differs from that of a Fermi liquid by a correction factor that is logarithmic in T. In contrast to w and 
p, the correction is weak and difficult to separate from phonon and other contributions to the specific heat in materials such as ZrZnp. 


In this paper we consider an example in which these difficulties 
have been overcome, and in which the signatures of the marginal 
Fermi-liquid state in the transport properties (w and p) have been 
observed and interpreted in terms ofa semiquantitative model for the 
first time. 

The material selected for the search for the marginal Fermi-liquid 
state is ZrZn,, which crystallizes in the C15 cubic Laves structure. 
This compound is the best-known member of a small group of mar- 
ginally ferromagnetic metals in which the constituent elemental 
metals are not magnetic (in fact, Zr and Zn are superconductors at 
low temperature). The low-temperature magnetization M and the 
Curie temperature Tc of ZrZn are both an order of magnitude 
smaller than that of nickel, the archetypal itinerant-electron ferro- 
magnet, and the magnetic transition is continuous except very close 
to the critical pressure where ferromagnetic order disappears’. Also, 
Fermi surface measurements'® show that the ferromagnetism in 
ZrZn, can be understood in terms of itinerant electrons alone, and 
inelastic neutron scattering studies'”’* reveal that the spin-fluc- 
tuation spectrum is principally characterized by dissipative modes 
of a form expected in the SCR model. 

Importantly, ZrZn, can be purified to a sufficiently high level to 
allow us to unambiguously separate out the electronic and phonon 
contributions to w. This has thus far been the central stumbling block 
in establishing the crucial difference between w and p expected for a 
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Figure 1 | Evidence for the marginal Fermi-liquid state in ZrZn3. a, b, Main 
panels, the thermal conductivity (a) and electrical resistivity (b) of ZrZn2 

(blue curves; residual resistivity py = 0.31 uQ cm). Insets in a and b show the 
same properties over a wider temperature range, and also for a sample with 
higher po (red curves; 6.9 1.Q cm). In this figure and Fig. 3 the error for each 
point is believed to be smaller than the size of the data points. Comparison of 
these two samples allows us to demonstrate that the thermal conductivity of 
the clean sample is strongly dominated by conduction by electrons up to 

15 K (Supplementary Information, section 3). ¢, The difference between the 


marginal Fermi-liquid state in a nearly or weakly ferromagnetic 
metal. The details of the measurements of w and p, the separation 
of the electronic and phonon contributions and numerical calcula- 
tions of wand p in the SCR model are given in the Supplementary 
Information. 

The ambient pressure measurements of the transport properties in 
ahigh purity sample and comparisons with the predictions of the SCR 
model are presented in Fig. 1. Figure 1a and b shows the temperature 
variations of the thermal conductivity, «, and of p, respectively, for a 
sample with a residual resistivity of po = 0.31 whQcm. (This corre- 
sponds to a residual resistivity ratio 0(373 K)/po of close to 200 and 
an electronic mean free path of several thousand angstroms.) The inset 
of Fig. la shows x versus T for this sample and for a less clean sample 
over a wide temperature range. A comparison of these two samples 
(Supplementary Information, section 3) allows us to conclude that 
below 15K the thermal conductivity of the high-quality sample is 
overwhelmingly dominated by electrons. Figure 1c shows the differ- 
ence, 0, between the thermal resistivity, which we define using the 
Wiedemann Franz law to be w= Ip T/«, and p, plotted versus tem- 
perature. Here Ly is the Sommerfeld value of the Lorenz number 
(2.45 X 10° WQK “”). Figure 1d shows the temperature-dependent 
part of the resistivity, Ap = p — po, plotted in this case versus T°”. 

We see that 6 is linear in T and Ap is linear in T°’? at low tem- 
peratures. As shown in the insets of Fig. lc and d, the observed 
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thermal and the electrical resistivity (5) is linear with temperature, as 
predicted for the marginal Fermi liquid. d, The temperature-dependent part 
of the electrical resistivity plotted against T°’? also shows the marginal 
Fermi-liquid dependence (the straight lines in ¢ and d are guides to the eye). 
Insets in ¢ and d show the predictions of the SCR model for ZrZn, for three 
different values of the parameter k, (in A}; in both cases the values are 
scaled to the electrical resistivity at 15 K. Details of these calculations are 
described in Supplementary Information, section 4. 
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behaviour is consistent qualitatively and even semiquantitatively 
with the predictions of the SCR model for ZrZn, (see also 
Supplementary Information, section 4). The predictions of the model 
are relatively insensitive to the precise model parameters. The most 
sensitive parameter is the characteristic diameter, k., of the Fermi 
surface of the dominant charge carriers, and the value that gives the 
best quantitative agreement is in keeping with the known Fermi 
surface of ZrZn (Fig. lc inset). 

The SCR model also predicts the orders of magnitude of Tpy, Typ. 
and Tc. The calculations of these quantities versus normalized pres- 
sure (p/p., where p, is the critical pressure where ferromagnetism is 
suppressed completely) for ZrZn, are shown in Fig. 2. Just below p,, 
where Typy is small and the phonon contribution to transport can be 
ignored in the marginal Fermi-liquid regime, we find a T°” variation 
of the electrical resistivity both below and above Tc, with an abrupt 
change in slope at Tc (Fig. 3). As shown in Fig. 3 inset, this behaviour 
is again consistent with the prediction of the SCR model. We observe 
an electrical resistivity exponent of approximately 5/3 at low T over 
the entire pressure range up to p, (Fig. 4). The sudden drop in the 
exponent at higher pressures (where the magnetic transition becomes 
first order'’) and beyond (p = p,) will be discussed below. 

Our findings thus provide strong evidence for the existence of 
marginal Fermi-liquid behaviour in ZrZn,. The new results are that 
(1) the thermal resistivity is linear in temperature and that this beha- 
viour is consistent with a T”? temperature dependence of the res- 
istivity within the marginal Fermi-liquid model, and (2) that the 
temperature dependences of the thermal and electrical resistivities 
are in agreement with the predictions of the SCR theory. This theory 
reduces to the theory of a marginal Fermi liquid over large portions of 
the temperature—pressure phase diagram of ZrZn). The signature of 
the marginal Fermi-liquid model should also appear in the temper- 
ature dependence of the specific heat and of the magnetic equation of 
state. The marginal Fermi-liquid form of the specific heat differs from 
that of a Fermi liquid by a factor logarithmic in T, which is weak and 
difficult to separate reliably from the phonon contribution in ZrZn, 
in the regime of interest, Tp,p<T< Typ. (ref. 19; see also 
Supplementary Information). This difficulty extends to a number 
of other d-electron systems, including the optimally doped copper 
oxides in which the predicted logarithmic correction to the specific 
heat is obscured by phonons and other effects’. The signature of the 
marginal Fermi-liquid state has also been difficult to identify in the 
magnetic equation of state for two reasons. These are, first, the 
extreme sensitivity of the uniform magnetization to magnetic inho- 
mogeneities and, second, the complex form of the free energy as a 
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Figure 2 | Predicted phase diagram of ZrZnz. Main panel, the solid line 
represents the Curie temperature T¢, and the two dashed lines correspond to 
crossover temperatures. Below the lower dashed line (Tp) the SCR model 
reduces to the Fermi-liquid model, whereas between the lower dashed line 
and upper dashed line (Tyyp,) the SCR model reduces essentially to a 
marginal Fermi-liquid (MFL) model (except in a narrow region near Tc). 
Inset, the same diagram with a logarithmic temperature scale. Ty, is defined 
by the resistivity temperature exponent dropping to 1.85, and Tyypr is 
defined as where the resistivity exponent drops to 1.55. 


1222 


NATURE|Vol 455|30 October 2008 


0.6 
5 0.4 
g 0.8 
a ea Calculation 
+ © 
0.2 x 
> 
a 
0 
, 0 20 40 60 80 
0 20 40 60 80 100 


T8/3 (K5/3) 


Figure 3 | Resistivity of ZrZn, just below the critical pressure. Main panel, 
the measured resistivity of ZrZn, at 15.2 kbar plotted against T°’, showing 
the T°? behaviour of the resistivity both below and above Tc but with a 
different slope. Inset, the SCR model prediction, which shows excellent 
qualitative (and semiquantitative; see Supplementary Information) 
agreement. The straight lines in these figures are guides to the eye. 


function of the uniform magnetization, which precludes a simple 
expansion of the magnetic equation of state as a power series in the 
magnetization’*”®. Crucially, these and other previously mentioned 
difficulties are absent or less severe for the case of the transport 
properties, which have therefore been the focus of our attention. 
We now consider other evidence of marginal Fermi-liquid beha- 
viour in weakly ferromagnetic metals. First, a 5/3 power law for the 
temperature dependence of the electrical resistivity is not rare in such 
systems, and has been seen recently not only in ZrZn, (refs 19, 21) but 
also, for example, in Pd, —,.Ni, (ref. 13) and Ni;Al (ref. 22). Second, a 
logarithmic correction to the specific heat has been identified in 
Pd, _,Ni, (ref. 13) and NbFe, (ref. 23). (Such a logarithmic correc- 
tion is also ubiquitous in f-electron metals near various subtle kinds 
of quantum critical points®.) The observation of both a T°” electrical 
resistivity and T In(T*/T) specific heat in Pd,—,Ni,, in particular, is 
compelling evidence for the existence of the marginal Fermi-liquid 
state of the non-local kind we are considering here. However, because 
Pd,_,Ni, is intrinsically disordered, a clear separation of the elec- 
tronic and phonon contributions to the thermal resistivity that 
would help to confirm the marginal Fermi-liquid scenario would 
seem to be ruled out in this system. We also note that a quantitative 
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Figure 4 | Pressure dependence of T; and the resistivity exponent in ZrZnp. 
Main panel, the pressure dependence of the Curie temperature T¢ as inferred 
from the kink in the resistivity. Error bars on pressure show the width of the 
superconducting transition of tin used to determine the pressure; error bars 
on temperature show the difference between several methods of determining 
the temperature of the kink in the resistivity. Upper inset, the temperature 
exponent of the resistivity (in the range 1-5 K) versus pressure, showing the 
abrupt change from around 5/3 to 3/2 at around 18 kbar. Error bars on X are 
determined from the uncertainty in the power-law fit to the data. Lower 
inset, the cubic Laves structure of ZrZn). 
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description of Pd,_..Ni, along the lines given here for ZrZn, has not 
yet been carried out to our knowledge, and may not be possible 
within the SCR model in its present form™. Despite these difficulties, 
the findings in Pd,_,.Ni, and other low- Tc. ferromagnetic metals in 
general provide further support for the idea that the marginal Fermi- 
liquid state can exist over at least some portion of the temperature— 
pressure phase diagram in the weakly ferromagnetic regime (p < p,), 
when the magnetic transition is continuous or not too strongly first 
order. 

Finally, we comment briefly on the curious drop in the electrical 
resistivity exponent from the marginal Fermi-liquid value at low pres- 
sures to a lower value of approximately 3/2 at higher pressures—that 
is, at pressures where the magnetic transition becomes first order and 
beyond (Fig. 4 inset). This surprising behaviour, which was reported 
in an earlier study by some of us”, is inconsistent with the predictions 
of the SCR model and remains a mystery that ZrZn, shares with other 
materials, including MnSi (refs 11, 24, 25) and PrNiOs (ref. 26). One 
possibility is that the state of these systems close to and beyond p, (that 
is, in the first-order regime and above) is characterized by slow mag- 
netic inhomogeneities not correctly described using the SCR model in 
its present form”’. Recent treatments that go beyond the SCR approxi- 
mation indeed suggest that magnetic inhomogeneities might arise 
generically near a ferromagnetic quantum phase transition 
(p = p.)***. Furthermore, energy-band structure calculations” sug- 
gest, seemingly paradoxically, that antiferromagnetic correlations can 
be pronounced in nearly ferromagnetic metals similar to ZrZn, at 
P = Po and can strongly modify the temperature dependence of the 
resistivity above p, from that expected in a marginal Fermi-liquid 
description. 
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Low-speed fracture instabilities in a brittle crystal 


J. R. Kermode’, T. Albaret”, D. Sherman’, N. Bernstein’, P. Gumbsch”®, M. C. Payne’, G. Csanyi’ & A. De Vita®? 


When a brittle material is loaded to the limit of its strength, it fails 
by the nucleation and propagation of a crack'. The conditions for 
crack propagation are created by stress concentration in the region 
of the crack tip and depend on macroscopic parameters such as the 
geometry and dimensions of the specimen’. The way the crack 
propagates, however, is entirely determined by atomic-scale phe- 
nomena, because brittle crack tips are atomically sharp and prop- 
agate by breaking the variously oriented interatomic bonds, one at 
a time, at each point of the moving crack front'*. The physical 
interplay of multiple length scales makes brittle fracture a complex 
‘multi-scale’ phenomenon. Several intermediate scales may arise 
in more complex situations, for example in the presence of micro- 
defects or grain boundaries. The occurrence of various instabilities 
in crack propagation at very high speeds is well known’, and sig- 
nificant advances have been made recently in understanding their 
origin**. Here we investigate low-speed propagation instabilities 
in silicon using quantum-mechanical hybrid, multi-scale model- 
ling and single-crystal fracture experiments. Our simulations pre- 
dict a crack-tip reconstruction that makes low-speed crack pro- 
pagation unstable on the (111) cleavage plane, which is con- 
ventionally thought of as the most stable cleavage plane. We per- 
form experiments in which this instability is observed at a range of 
low speeds, using an experimental technique designed for the 
investigation of fracture under low tensile loads. Further simula- 
tions explain why, conversely, at moderately high speeds crack 
propagation on the (110) cleavage plane becomes unstable and 
deflects onto (111) planes, as previously observed experimentally®”’. 

Until relatively recently, the process of brittle fracture has been 
primarily analysed using continuum mechanics techniques, which 
predict the driving force for crack propagation—the release of elastic 
energy per unit advancement as the crack progresses—to be a func- 
tion of the applied load, the length of the crack and a geometry- 
dependent factor'’**°. The crack is assumed to propagate as long as 
its energy release rate, G, is equal to or larger than a material- and 
orientation-dependent critical value. Griffith®, who first carried out 
this analysis, equated the critical energy release rate below which 
crack propagation cannot take place with 2y, the energy needed to 
create the new crack surfaces. This approach provides a good descrip- 
tion of the fracture process for a straight crack in both static® and 
dynamic regimes’, as shown by recent experiments on carefully pre- 
pared silicon single crystals”'°’. 

More delicate issues such as the origin of crack path instabilities 
and materials’ resistance to fracture are, however, poorly understood. 
Even very simple atomistic models show that the discrete nature of 
the material will manifest itself in a higher resistance to fracture than 
predicted by Griffith’s criterion, owing to lattice (or bond) trap- 
ping*"*. This effect depends on the detailed nature of the atomic 
interactions in the specific material*'®, and may be very different 


for different cleavage planes. For a given cleavage plane, it can lead 
to slow crack growth'*"* and anisotropy with respect to propagation 
along different crystallographic directions'”'* or even the sign of the 
propagation direction”. 

Our simulations use the ‘learn-on-the-fly’ (LOTF) hybrid clas- 
sical/quantum mechanical molecular dynamics method, which pro- 
vides a nearly seamless coupling of the mobile quantum mechanical 
and the classical regions”. We use the quantum mechanical chem- 
ical-bonding description, where such coupling is necessary near the 
crack tip. In the rest of the sample it is unnecessary and would be 
prohibitively expensive computationally. We first consider a (111) 
crack plane with the crack front aligned with the [110] direction, 
which is under a uniaxial [111] tensile load (Fig. 1). Our simulations 
indicate that the K-field displacement predicted by continuum the- 
ory’” is never stable. Below a load of G= 2.1 Jm ”, which is much 
lower than the Griffith critical load (G. = 2y = 3.1 Jm’, using a 
value for the surface energy computed with our quantum mechanical 
model), the crack closes. Above this load, a crack-tip reconstruction 
forms spontaneously by means of a bond rotation (Fig. 1c), resulting 
in the formation of five- and seven-member rings (see Supple- 
mentary Information). This relieves the stress along the main load 
axis in the immediate tip region, thus inducing additional lattice 
trapping. The reconstructed geometry of the crack tip is structurally 
similar to the Pandey reconstruction of the (111) surface’. As such, it 
could be viewed as the core of a dislocation that remains trapped at 
the crack tip, unable to induce large-scale plasticity at temperatures 
much lower than the brittle-to-ductile transition. 

In simulations performed above the critical load for propagation, 
the slowly moving (111)[110] crack occasionally visits the recon- 
structed tip configuration as it propagates. Remarkably, for slow 
enough propagation, the [111] oriented bonds in the six-member 
rings located one atomic layer below the initial crack plane and 
further along the propagation direction can break before the bridging 
bonds associated with the seven-member ring. Indeed, these are the 
bonds where the stress is mostly concentrated after the reconstruc- 
tion. Every such reconstruction and bond-breaking event creates a 
downward [111] step for the system oriented as in Fig. 1c, owing to 
the broken mirror symmetry of the crystal. This is at variance with the 
behaviour expected for propagation at high speeds (larger than those 
investigated here), where both upwards and downwards steps may 
occur and macroscopically smooth (111) cleavage surfaces are 
invariably observed’. Our simulations thus predict a systematic 
deflection of the crack path from the (111) plane at sufficiently low 
velocities, whereas fast cracks are dynamically steered away from the 
instability. We speculate that the deflection process may start with a 
reconstruction event induced by the local slowing of a crack-front 
segment, perhaps due to an encounter with a crystal defect. However 
it starts, this process will initiate a positive-feedback ‘sinking’ 
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mechanism, because each reconstruction event will further trap and 
slow the crack, leading to ever more reconstructions, the whole pro- 
cess yielding observable fracture surface features. 

A simple mesoscopic model of the crack-tip reconstruction and 
ensuing feedback process shows how this perturbation can evolve 
into the macroscopic ridge-like feature (Fig. 2d). A crack-front sec- 
tion of length L is modelled by an elastic string”, represented by a set 
of N beads, each of mass m, connected by springs. The beads’ posi- 
tions and velocities are represented by the array of dynamical vari- 
ables {r;, v;}, i= 1,...,N, and periodic boundary conditions are 
imposed in the crack-front y direction. The values of most of the 
parameters in what follows can be directly derived from experimental 
data, as detailed in the Supplementary Information. The equations of 
motion for the beads are expressed as the following dynamical map, 
where At is the time step associated with each iteration of the map: 


r(t+At)=r,(t)+Atv((t) 


vi(t+At)=v;(t)4 at [fo% — bv,(t) + k(n; 4 ,(¢) + ri-1() —2r((t))| 


6 70M OMEN GY manOg7 — DR (1) 


s 
~ 1 +exp((v — vg) /d) 
Consider the terms on the right-hand side of equation (1), in the 


order they appear. The model incorporates crack-front acceleration 
in the propagation direction x due to the driving force f°X induced by 


ci (2) 


Figure 1| The simulated (111)[110] crack system. a, Thin-strip geometry of 
the simulation. b, Map of the o,,, stress component near the crack tip (red, 
highly stressed; black, unstressed). ¢, Ground-state reconstruction geometry 
of the crack tip under load, created by a rotation of the bond indicated by the 
arrow. Red atoms are described using quantum mechanics, yellow atoms 
using an interatomic potential. The small white atoms represent hydrogen 
atoms that we used to terminate the initial surface behind the crack to 


prevent it from closing up at low loads, and thereby ease the search for a load 
value that stabilizes the crack (with or without terminating hydrogen 
atoms). The first few bonds that break after the reconstruction are in six- 
member rings adjacent to the seven-member ring of the reconstruction. 
Inset, crack-tip structure after reconstruction and subsequent bond 
breaking. Further increasing the load finally breaks the crack-tip bridging 
bond and advances the open crack surface, yielding a downward step in the 
crack plane (not shown). 
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the load G. This continues until a limiting speed Vjnax = f °/b, due to a 
velocity-dependent phonon drag force —bv,(t) with friction coef- 
ficient b, is reached. Line tension along the crack front is modelled 
by the springs of strength k. Crack-tip reconstruction occurs at a rate 
which smoothly rises for decreasing crack speeds, leading to the crack 
climbing by one atomic layer after each reconstruction and slowing 
the crack due to reconstruction-induced lattice trapping. This is the 
key assumption of the model and yields the drag term quadratic in 
the velocity, which is connected with equation (2) as discussed in 
detail below. Last, there is a driving force increase for crack-front 
sections ‘held back’ during the propagation, to include stress- 
enhancement effects at those segments of the crack that have slowed 
and trail behind the rest of the crack front. This leads to extra driving 
forces for these segments, which eventually level the ridges. Although 
the last effect is well known, its precise functional formulation and 
scaling are not. However, the model is not very sensitive to the details 
of the stress-enhancement term, so varying its functional form and 
prefactor 17 gives rise to only slightly different ridge shapes. 

The variable € in equation (2) (dropping the subscript) is the local 
climb slope due to reconstruction, which is velocity dependent. The 
exact functional form of € is again not particularly important, as long 
as it implies no reconstruction at high velocities (where, thus, € ~ 0) 
and a smooth transition to the maximum observed climb slope, s, at 
low velocities. A simple exponential form with scale parameter d that 
centres the transition on a chosen value vp < Vjnax is used in equation 
(2), which thus just expresses the fact that climbing results from low 
propagation speeds, in agreement with our calculations and experi- 
ments. A macroscopic propagation instability can arise if low pro- 
pagation speeds can, in turn, be caused by climbing, creating a 
positive-feedback loop, when a perturbation initiates the whole pro- 
cess by slowing a point of the crack front below a critical speed 
Ve Ver dees 

Our simulations provide an atomic-scale mechanism leading to 
such a situation, by showing that climbing is associated with tip 
reconstruction events, promoted by low propagation speeds and, 
in turn, capable of momentarily trapping, and thus locally further 
slowing, the crack front. In our model we represent this deceleration 
by imparting to the system a small impulse, directed opposite to the 


100 um 


Figure 2 | Ridges formed by low-speed instabilities on the (111) crack 
plane. a, Optical micrograph of the (111) crack system. The crack advances 
from left to right as it accelerates up to a speed of about 800ms_‘, as 
determined from the faintly visible Wallner lines**. The ridges start to form 
at a variety of crack speeds. b, The fracture apparatus, showing the silicon 
specimen glued to the aluminium frame. ¢, Atomic force micrograph close- 
up ofa ridge. d, Evolution of the crack front with the ridge shape predicted by 
a mesoscopic model based on the atomistic reconstruction mechanism 
illustrated in Fig. 1. The red lines represent snapshots of the propagating 
crack front yielding the interpolated crack surface (grey). 
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direction of motion, associated with each reconstruction event. If we 
assume for simplicity that an average of n reconstruction events take 
place during the time interval At for a given bead, its momentum mv 
will drop by — nmvt/At, where the parameter t < Aris representative 
of the time delay associated with each reconstruction. The number of 
events is related to the propagation velocity by n = €vAt/I, where Lis 
the bilayer distance in the (111) direction, that is, the amount of 
climb for each reconstruction event. Thus, the momentum change 
is —mévt/l, which is the nonlinear drag term in equation (1) dis- 
cussed above. We note that if the frequency of reconstruction events 
were constant for a given temperature, as is the case for simple acti- 
vated processes, then € would scale as 1/v, and would just give rise to a 
simple drag term, linear in the velocity. Thus, in an indirect way, this 
model sheds light on the role of the cooperative, dynamic nature of 
brittle fracture: at high speeds, the atoms near the crack tip are steered 
towards clean cleavage. 

We perform experimental studies of the low-crack-speed regime 
using a technique for applying very small but steady and well-con- 
trolled tensile loads. A silicon specimen is loaded by taking advantage 
of the thermal expansivity mismatch between the sample and the 
aluminium loading frame (Fig. 2b). Micrographs of the resulting 
(111) fracture surface are shown in Fig. 2a, c. Triangular ridges, all 
deviating in the same direction from the fracture surface, form at a 
range of low crack speeds below about 800ms'. At higher crack 
speeds, of about 2,000 ms ', the surface is mirror smooth and no 
ridges are present. The crystallographic direction of the deviation 
(identical in over 40 independent samples) is the same as the recon- 
struction-induced steps in the atomistic simulation, and the shape of 
the ridges is qualitatively in agreement with the mesoscopic model 
(Fig. 2c, d). Similar features have recently been reported’? under 
more complicated loading conditions at a speed of about 
1,000 ms_ | (see Supplementary Information). 

We next considered the (110) crack plane. Experiments have 
shown that (110)[110] cracks propagating along the [001] direction 
deflect out of the plane at very low velocity®'*, whereas (110)[001] 
cracks propagating in the [110] direction stay on the (110) plane up to 
very high velocity (about 2,900 ms__') before also faceting onto (111) 
planes®’. Recent studies* have assigned critical velocities for this 
instability for various propagation directions and have shown that 
in all cases the deflection is not immediate but only occurs after some 
initial propagation on the (110) plane. We simulated the (110)[110] 
crack propagation and observed the onset of this deflection. This 
crack propagates by breaking a series of bonds, labelled A in Fig. 3a. 
The system also contains type-B bonds, which are oriented at the 
same angle with respect to the tension axis as A bonds. Resolving the 
elastic stresses on the atomic scale reveals that B bonds located imme- 
diately above and below the crack plane are almost as highly stressed 
as A bonds. However, strong neighbour bonds connecting under- 
coordinated atoms left exposed by the advancing crack make the A 
bonds weaker than B bonds, so the former are expected to break 
selectively during quasi-static (110) cleavage, consistent with the 
low-speed experimental observations. 

In the simulations we initially observe each newly exposed under- 
coordinated surface atom snap back towards the subsurface region. 
This induces significant local atomic motion and the excess energy 
diffuses into the bulk crystal, but no immediate rebonding occurs as 
long as the speed of propagation is sufficiently low. However, bond- 
breaking events become more frequent with increasing crack speed, 
and the local relaxations overlap in time. As more kinetic energy is 
locally available, the fast crack front ‘stumbles’. Local reconstructions 
involving the removal of under-coordinated atoms begin to occur on 
the open surfaces. This removes the reason for selective A-bond 
breaking and in our simulation we indeed observe B-bond breaking 
events that deflect the crack front onto a (111) plane. Further simula- 
tions reveal that under these conditions any slight disturbance away 
from pure tension (for example a small extra shear strain component 
as in Fig. 3b) can systematically reverse the initial relative stability of 
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Aand B bonds. This results in multiple coherent breaking of B bonds, 
effectively exposing (111) surfaces, consistent with the crack motion 
observed experimentally (Fig. 3c—e). 

No near degeneracy of crack-tip bonds exists in the orthogonal 
[110] direction, consistent with the observation of instabilities only 
at much higher speeds*®. The deflection mechanism is thus only 


Figure 3 | The (110)[110] crack system. a, Geometry of the crack tip 
propagating straight at low speeds by sequential breaking of type A bonds 
(blue). Red atoms are described using quantum mechanics, yellow atoms 
using an interatomic potential. At each propagation step, the A bond is 
weaker than the corresponding B bond because of a neighbouring under- 
coordinated atom (indicated by an arrow for the A,—B, pair). b, At higher 
speed an instability occurs. At this point any slight shear disturbance in the 
stress field reverses the relative stability of A and B bonds, and the crack is 
deflected onto a (111) plane. The energy release rate for this system is 
G=6.7J m ~. In addition to the main tensile load, this includes a small 

G = 0.24Jm ® shear contribution to break the symmetry in the y direction. 
Both exposed (111) crack surfaces undergo a 2 X 1 reconstruction. 

c-e, Photographs from three-point bending experiments® (reprinted with 
permission): (110) fracture surface for low-speed cleavage (grey; c); fracture 
surface of a specimen with the crack deflected from the (110) plane (white) to 
the (111) plane (black) for intermediate-speed cleavage (d); (111) fracture 
surface obtained for high-speed cleavage (e). 
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operative for the [001] crack propagation direction. By contrast with 
the (111) cleavage plane, here the formation of new chemical bonds, 
eventually leading to crack deflection, is favoured by a relatively high 
propagation speed rather than a low one. This is due to the different 
interplay between the local atomic motion and the ‘lattice trapping 
energy liberated by the cleavage during the time intervals between 
consecutive crack-tip advancement events. 

For both cleavage planes in silicon, subtle atomistic details of the 
crack tip involving dynamical reconstruction mechanisms govern the 
path of the crack. This is unlikely to be the case for silicon only, owing to 
the general multi-scale character of the fracture processes. In the 
Supplementary Information we present data on preliminary simula- 
tions that indicate similar crack-tip reconstruction phenomena in dia- 
mond and silicon carbide, and on experiments on sapphire that show 
low-speed instabilities similar to those observed in silicon. Therefore, 
we suggest that crack-tip reconstructions occur more generally in brittle 
solids and control the qualitative macroscopic fracture behaviour. 


METHODS SUMMARY 


Tensile loading of the silicon (111) system was performed using apparatus shown 
in Fig. 2b. The load is applied by heating the whole set-up by a few degrees 
Celsius. The experimental setup for the (110) system was identical to that in 
ref. 6. The LOTF hybrid simulations were carried out using the Stillinger-Weber 
classical potential’? and a quantum mechanical model based on density func- 
tional theory, as implemented in the CASTEP”’ and SIESTA*® packages. The 
quantum mechanically treated region was selected automatically during the 
simulation, on the basis of geometric criteria in relation to the instantaneous 
crack-tip position. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


For the (111) system, cleaving of 525-jtm-thick silicon specimens of 42 X 26- 
mm lateral dimensions, cut from a [110] 4-inch silicon wafer, was performed 
under tensile load using the apparatus shown in Fig. 2b. The specimens were 
notched with a 150-j1m-thick diamond saw to a length of 1.5mm, heated to 
200 °C and quenched in water to generate a pre-crack of the required length. The 
specimens were then glued to an A17075 T6 frame using a 150-tm-thick film of 
Epon 815C epoxy resin. The load was applied by heating the whole set-up by a 
few degrees Celsius on a laboratory heating stage, using the fact that the thermal 
expansion coefficients of the sample and the loading frame are different. When 
the rate of heating was well below 1°C min ', the instabilities were always 
present. Using high rates of heating resulted in the propagation of fast cracks 
leaving behind mirror-smooth surfaces. 

In the computer simulations, the CASTEP” package was used for the (111) 
system and the SIESTA” package for the (110) system. The classical potential was 
adjusted to match the linear elastic properties of the quantum mechanical mod- 
els in each case. The simulations were carried out in the plane strain configura- 
tion and free boundary conditions were used in the x direction. The load, applied 
by imposing a fixed-displacement boundary condition in the y direction, was 
increased in a number of steps starting from G= 2.5J m *, allowing at least 1-ps 
equilibration time between successive steps, until the crack started to propagate. 
A variety of strain rates were tested down to 0.1% ps ', with no qualitative 
difference observed. The load needed to crack the system was 6.0J m ” at the 
lowest strain rate we could achieve. Because the load was found still to be strongly 
strain rate dependent even at this strain rate, this value is only an upper bound. 

The constant temperature (300 K) simulations for the system shown in Fig. 1 
were made on a 1200 X 400 * 3.8-A°, 90,868-atom model thin slab, periodic in 
the crack-front direction. For these low-temperature, highly driven systems, 
thermal activation and entropy effects, which could induce a qualitatively dif- 
ferent behaviour in three-dimensional systems, are unlikely to have a significant 
role. A series of quasi-static simulations show the same behaviour, although the 
system has to be overloaded much more in order to crack, as expected owing to 
the high lattice trapping. 

In the plane-wave density functional theory part of the LOTF hybrid simu- 
lation, the Brillouin zone was sampled with four points along the crack front, 
exchange and correlation were described using the Perdew—Burke—Ernzerhof 
functional, nuclei were described by ultrasoft pseudopotentials, and the elec- 
tronic wavefunctions were expanded in plane waves with a 180-eV kinetic energy 
cut-off. The quantum mechanical region was selected using an algorithm 
detailed in the Supplementary Information. The buffer region of the LOTF 
scheme” was set at about 8A (fourth-neighbour distance). 

Furthermore, we made a test calculation on a 15.2-A-thick slab using a tight 
binding” quantum mechanical engine to verify that the same phenomena apply 
in the true three-dimensional case. The reconstruction occurred at all sites along 
the crack front in quick succession before any other kind of rebonding took 
place. The simulation of the system shown in Fig. 3 was made on a 
687 X 655 X 7.6-A°, 174,752-atom model slab, using the standard single zeta 
plus polarization basis set with soft confinement for the local orbital density 
functional theory part of the calculation. The mesh cut-off was 50Ry 
(1 Ry = 13.6 eV) and norm-conserving pseudopotentials were used. 
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Medieval forewarning of the 2004 Indian Ocean 


tsunami in Thailand 


Kruawun Jankaew’, Brian F. Atwater’, Yuki Sawai’, Montri Choowong’, Thasinee Charoentitirat', Maria E. Martin* 


& Amy Prendergast? 


Recent centuries provide no precedent for the 2004 Indian Ocean 
tsunami, either on the coasts it devastated or within its source area. 
The tsunami claimed nearly all of its victims on shores that had 
gone 200 years or more without a tsunami disaster’. The associated 
earthquake of magnitude 9.2 defied a Sumatra—Andaman cata- 
logue that contains no nineteenth-century or twentieth-century 
earthquake larger than magnitude 7.9 (ref. 2). The tsunami and 
the earthquake together resulted from a fault rupture 1,500 km 
long that expended centuries’ worth of plate convergence”. 
Here, using sedimentary evidence for tsunamis®, we identify pro- 
bable precedents for the 2004 tsunami at a grassy beach-ridge plain 
125 km north of Phuket. The 2004 tsunami, running 2 km across 
this plain, coated the ridges and intervening swales with a sheet of 
sand commonly 5-20cm thick. The peaty soils of two marshy 
swales preserve the remains of several earlier sand sheets less than 
2,800 years old. If responsible for the youngest of these pre-2004 
sand sheets, the most recent full-size predecessor to the 2004 tsu- 
nami occurred about 550-700 years ago. 

The 2004 Indian Ocean tsunami, cresting higher in Thailand than 
it did anywhere else east of Sumatra (Fig. 1b), rose as much as 20m 
above sea level on Phra Thong Island’. The main wave, as observed 
from a hill on the island’s western shore (H, Fig. lc), formed a 
relentless flood that rose stepwise to heights above treetops®. The 
tsunami ran more than 2 km inland across a Holocene plain com- 
posed of grassy beach ridges and intervening tree-lined swales. 

The flooding at Phra Thong, as on other Thai coastal plains”, 
produced local erosion and widespread deposition. The tsunami 
reamed out several drainages previously cut across beach ridges as 
much as 300m inland (Fig. lc, e, and Supplementary Fig. 1). In 
addition, it coated most of the island’s western half with a sheet of 
sand (Fig. 1d). This sheet is locally lineated with sand streaks that 
extend inland from the spoil piles of tin miners (Fig. le and 
Supplementary Fig. 1). Its mean particle size ranges from medium 
to very fine on a transect across the northern part of the island". 
Horizontal bedding is common, as is overall upward fining to coarse 
silt. 

Building on a previous reconnaissance"’, we sought pre-2004 sand 
sheets at Phra Thong by digging pits and augering holes into ridges 
and swales at more than 150 sites (Fig. 1d (dots) and Supplementary 
Table 1). At 20 of these sites we found pre-2004 sand interbedded 
with the peaty soils of swales that hold standing water most of the year 
(Fig. 1d, red dots). We found no pre-2004 sand beds in the quartz- 
sand soils of the ridges or in the slightly organic soils of swales that are 
merely damp. 

We traced pre-2004 beds across each of two marshy swales near a 
place where the 2004 tsunami reportedly flowed about 10m deep 


(Figs le and 2a, and Supplementary Fig. 3a). These swales formed 
about 2,500 years ago (bark and shell dates; Fig. 2b, c), when the 
area’s relative sea level was probably within 1-2 m of its present 
position’’. Because beaches have built the island westwards, the more 
westerly of the swales (X) postdates its neighbour (Y). We assembled 
stratigraphic cross-sections from correlated pits, from auger borings 
and from a trench 35m long, estimated particle size in the field, 
inferred a preliminary chronology from radiocarbon dating of indi- 
vidual plant remains and shells (Figs 2c and 3, and Supplementary 
Table 2), and made diatom analyses (Supplementary Figs 4 and 5). 

Peaty soil in swale X contains two sand sheets (B and C in Fig. 2b, 
d) that resemble the overlying 2004 deposit. Sheet C, the earlier, is 
commonly 10cm thick. Coarse to very coarse sand forms a discon- 
tinuous basal layer that fills pre-existing pockets in the underlying 
soil. The rest of sheet C consists of very fine sand and coarse silt that 
contains horizontal laminae defined most visibly by leaf fragments 
(Fig. 3c). The entire sheet formed after 2,200-—2,400 sidereal years ago, 
the age of an isolated piece of bark in the uppermost 1 cm of the 
underlying buried soil. Leaf fragments within the sheet gave ages that 
conflict with one another and that exceed the bark age by thousands 
of years (Fig. 3c). Sheet B, commonly 5 cm thick, typically fines 
upwards from fine sand to sandy silt. It conformably overlies peaty 
soil that contains a horizon of bark fragments in its uppermost 1 cm. 
Three of these fragments, collected separately, yielded ages between 
530 + 40 and 570 + 40 radiocarbon years before AD 1950 ('*C yr BP; 
Fig. 3b). If scarcely younger than these fragments, sheet B was depo- 
sited about 550-700 sidereal years ago (Supplementary Table 2). 

Three pre-2004 sand sheets alternate with peaty soil in swale Y 
(Fig. 2c, e). All three are similar in thickness to the overlying 2004 
sand sheet, and all extend preferentially up the swale’s seaward side. 
All were formed after the swale ceased holding an intertidal flat that is 
marked by non-abraided molluscan shells 2,500—2,800 sidereal years 
old (Fig. 2c and Supplementary Table 2). The lowest two sheets, 
otherwise undated and thus left uncorrelated with swale X, consist 
mainly of very fine to fine sand. They lack sedimentary structures, 
probably because of bioturbation that blurs their contacts with the 
soils beneath (Fig. 2e). The highest pre-2004 sheet (B) retains a sharp 
base and tabular shape that extend the full length of the trench 
(Fig. 2f). This sheet typically fines upwards from basal fine or med- 
ium sand to parallel-laminated very fine sand that abounds in leaf 
fragments (Fig. 3a). It probably correlates with sheet B of swale X 
because each is the youngest pre-2004 sand sheet in its swale and 
because the leaf fragments in swale Y yielded ages too young for 
correlation with sheet C (Fig. 3a, c). 

Although the 2004 sand sheet abounds in brackish and marine 
diatoms, the earlier sand sheets in swales X and Y lack diatoms of 
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any kind (Supplementary Figs 4 and 5). Marine and brackish-water 
diatoms aid in identifying tsunami deposits on temperate shores". 
Perhaps their opaline silica valves do not last long in tropical warmth; 
in experiments, the dissolution of diatoms increases with temper- 
ature’. 
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Preservation is also a problem for the sand sheets themselves. The 
pre-2004 sheets are distinct and sharply bounded where the swale soil 
is peaty, blurred by gradational contacts where the soil is just slightly 
organic, and totally absent in the sandy soils of beach-ridge crests. 
The 2004 tsunami deposit is already headed towards this fate: in wet 
swales it has a protective cap of organic matter as much as 5 cm thick, 
whereas on ridge crests it lacks any cover other than ejecta from 
burrows that tap the underlying sandy soil. Sheet B, if truly correl- 
ative between swales X and Y, initially spanned the intervening beach 
ridge for a total shore-normal length of no less than 100 m (Fig. le). 

Although sand sheets can record intense storms that drive waves 
over or through sandy beach berms’*, the geographic setting limits 
Phra Thong’s exposure to such storms. Less than 10° from the 
Equator, this part of Thailand fringes the belt where the Coriolis 
minimum limits cyclonic winds'’’. Scores of twentieth-century 
cyclones originated in Indian Ocean waters to its west, but all these 
moved towards India, Bangladesh or Myanmar™® without producing 
a known storm surge in Thailand’. Tropical cyclones do strike 
Thailand from its Pacific side. However, such a storm loses strength 
during its overland crossing to the Indian Ocean (an example is given 
in Supplementary Fig. 6), and its anticlockwise winds can pile the sea 
against Thailand’s west coast only in the storm’s trailing-left quad- 
rant. 

Phra Thong’s setting also disfavours sand-sheet deposition by river 
or wind. Tidal inlets separate the island from the nearest rivers 
(Fig. 1c). Aeolian dunes obscure little, if any, of the island’s delicate 
striping by beach ridges and swales (Fig. le and Supplementary Fig. 
1). 

Chronology provides three further reasons to ascribe the pre-2004 
sand sheets to tsunamis. First, the middle Holocene ages of the leaf 
fragments from sheet C (Fig. 3) imply scour into long-buried depos- 
its beneath tidal inlets. The 2004 tsunami showed capacity for such 
scour by knocking down mangroves along an inner part of the inlet 
that bounds Phra Thong Island on the south (at K in Fig. 1d). Second, 
the sand sheets represent infrequent events: the soil between sheets C 
and B spans 1,500-1,850 years, although it may contain the biotur- 
bated remains of an intervening sheet (Supplementary Fig. 2c); and 
the interval between sheet B and the 2004 tsunami lasted nearly 550- 
700 years (ranges computed from calibrated ages in Supplementary 
Table 2). These time intervals are in the broad range of deductive 
estimates for the recurrence of giant earthquakes in the Sumatra— 
Andaman source region of the 2004 tsunami* °. Third, sheet B, if little 
younger than AD 1300-1450 (Supplementary Table 2), may correlate 
with tsunami and earthquake evidence elsewhere. The youngest 
widespread pre-2004 sand sheet on a beach-ridge plain at 
Meulaboh, Sumatra (Fig. la, Me) overlies plant detritus dated to 
AD 1290-1400 (ref. 20). Two coral fragments on a marine terrace in 
the Andaman Islands gave ages in the range AD 1200-1650 (ref. 21). 
However, in accounts from Ibn Battuta (journey, ap 1325-1354)” 
and the great Ming armadas (voyages, AD 1405-1433)”, we found 
no written evidence for a sheet-B tsunami on Sumatran and Sri 
Lankan shores that the 2004 tsunami would overrun. 


Figure 1| Setting. a, Northern Sunda Trench and vicinity. Red lines show a 
modelled fault slip during the 2004 Sumatra—Andaman earthquake’ at 5 
(outer contour), 10, 15 and 20 m. Green lines show pre-2004 rupture 
areas~”*. Bathymetry is shaded at 1-km intervals. Me, Meulaboh; P, Phuket. 
b, Heights of the 2004 tsunami on the eastern rim of the Indian Ocean””””. 
The tallest bars for Thailand obscure dozens of height measurements below 
local maxima. c, Landforms of Phra Thong Island. White areas denote beach 
ridge plains (examples in e and Supplementary Fig. 1). The lines of angular 
discordance (repeated for reference in d) probably record shoreline retreat. 
H, Hornbill Hill’. d, Tsunami deposits on Phra Thong Island. K, knocked- 
down mangroves along southern inlet. The 2004 tsunami heights are 
maxima in metres above sea level (ref. 7 and Supplementary Fig. 1). e, Area 
of stratigraphic evidence shown in Figs 2 and 3. Post-tsunami image, 
probably taken early in 2005, from PointAsia.com. Trees and shrubs (dark 
green) delineate some of the swales. 
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Figure 2 | Cross-sectional shape of sand sheets. a, Topographic profile 
along the line in Fig. le. VE, vertical exaggeration. Maximum tsunami 
heights are from an eyewitness in the indicated tree (Fig. le and 
Supplementary Fig. 3a) and from a post-tsunami survey 1 km to the 
southwest (Fig. 1d and Supplementary Fig. 1b). b, c, Cross-sections in swales 
X (b) and Y (c) from inferred correlation between pits and auger borings 
(vertical grey lines). Soil peaty in swales (dark brown) and sandy on ridges 
(light brown). Ages are from Supplementary Table 2. d, e, Sand sheets 
alternate with dark peaty soils on the walls of pits in swales X (d) and Y 
(e). f, Lateral continuity of sand sheet B exposed in trench. 


What tsunami sources might Phra Thong’s pre-2004 sand sheets 
represent? Too little is known about the sheets’ landward extent on 
the island, let alone their potential correlates on other Indian Ocean 
shores, to require full-size predecessors to the 2004 Sumatra— 
Andaman earthquake. However, the sheets probably required rup- 
tures larger than that of 1881 (Fig. 1a); no sand sheet from the 1881 
tsunami, which crested less than 1 m high on Indian tide gauges”, is 
evident at Phra Thong Island in fibrous peaty soils that the 2004 
tsunami failed to incise while covering them with sand (Fig. 2d and 
Supplementary Fig. 3c—g). The pre-2004 sheets may also require 
Sunda Trench earthquakes larger than magnitude 8.5 if, as estimated 
from numerical simulations’, such earthquakes would spawn Thai 
tsunamis only a few metres high—barely high enough to invade Phra 
Thong’s beach-ridge plain (Fig. 2a). 

Sand sheets of Phra Thong Island thus forewarn of infrequent 
catastrophe. They are already providing public officials and coastal 
residents with tangible evidence that the 2004 tsunami was not the 
first of its kind (Supplementary Fig. 7). Still to be determined is 
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Figure 3 | Parallel laminae and radiocarbon ages. a, b, Indistinct horizontal 
laminae in sheet B are highlighted by leaf fragments. The two fragments 
dated yielded ages that differ from one another by about 600 radiocarbon 
years (a). The younger of these ages overlaps with ages on detrital tree-trunk 
bark in the highest part of the underlying soil (b). c, Leaf-fragments from 
laminae in sheet C yielded widely scattered ages thousands of years greater 
than that of bark in the underlying soil, all sampled within the same pit 
(Fig. 2d). 


whether centuries dependably separate such outsize tsunamis of 
Sumatra—Andaman source, and whether these recur often enough 
to dominate Thailand’s probabilistic tsunami hazard. Tsunamis 
without precedent in written history may threaten Indian Ocean 
shores that face other parts of the Sunda Trench and the Makran 
subduction zone****. It can be hoped that natural warnings from 
recent geological history will help avert surprises from these addi- 
tional tsunami sources. 


METHODS SUMMARY 


Landforms in Fig. 1c were traced from 1:50,000-scale aerial photographs taken in 
1999 and from post-tsunami satellite images at PointAsia.com and Google Earth. 
In Fig. 2a the levelling had a closure error of 10 cm, and the tide levels refer to the 
Khura Buri gauge, 16km northeast of Phra Thong. Diatom separation and 
analyses are described in Supplementary Figs 4 and 5. A separate suite of samples 
from sheet B yielded rare foraminiferal tests, all well preserved, that we do not 
interpret because of possible laboratory contamination with tests from the 2004 
deposit. The radiocarbon age ranges reported in sidereal years span the 95% 
confidence interval from counting and calibration statistics (Supplementary 
Table 2). 
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A 1,000-year sediment record of tsunami recurrence 


in northern Sumatra 


Katrin Monecke't, Willi Finger’, David Klarer®, Widjo Kongko“t, Brian G. McAdoo”, Andrew L. Moore® 


& Sam U. Sudrajat’” 


The Indian Ocean tsunami of 26 December 2004 reached max- 
imum wave heights of 35m in Aceh, the northernmost province 
of Sumatra’’. Both the tsunami and the associated Sumatra— 
Andaman earthquake were unprecedented in Acehnese history**. 
Here we use sand sheets to extend tsunami history 1,000 years into 
Aceh’s past. The 2004 tsunami deposited a sand sheet up to 1.8 km 
inland on a marshy beach ridge plain. Sediment cores from these 
coastal marshes revealed two older extensive sand sheets with simi- 
lar sediment characteristics. These sheets, deposited soon after 
AD 1290-1400 and ap 780-990, probably resulted from earlier 
tsunamis. An additional sand sheet of limited extent might correlate 
with a documented smaller tsunami of AD 1907. These findings, a 
first step towards a palaeotsunami record for northern Sumatra, 
suggest that damage-causing tsunamis in Aceh recur infrequently 
enough for entire human lifetimes to typically elapse between 
them. Such recurrence adds to the challenge of preparing com- 
munities along the northern Indian Ocean shorelines for future 
tsunamis. 

Aside from the 2004 tsunami and its successor in 2005, few tsuna- 
mis are documented in 400 years of Indonesian history to have 
reached or approached Aceh (Fig. la)**. The largest of these, in 
1907, devastated the west coast of Simeulue Island and extended 
more than 950 km along the northwestern coast of Sumatra*. Run- 
up heights in Aceh are uncertain but, as estimated from interviews 
with inhabitants, they were far from reaching the extreme wave 
heights of the 2004 tsunami (Supplementary Note 1). Previously, 
in 1861, Aceh apparently sustained little or no damage from a docu- 
mented west Sumatran tsunami‘ associated with a likely predecessor 
of the Nias earthquake (moment magnitude (M,,) = 8.7) in March 
2005 (ref. 7). The latter caused a tsunami that reached heights of 
1-4 m in Aceh*. 

We sought geological evidence for past tsunamis on a marshy plain 
along 4km of coast north of Meulaboh (Fig. 1a, Me). Marshes have 
previously yielded geological histories of tsunamis in the northwes- 
tern United States’, Kamchatka’, Japan'' and Chile’’. In Aceh, the 
marshy plain shows a characteristic ridge and swale topography that 
is formed by rapid shoreline progradation. Individual beach ridges, 
which mark former positions of the shoreline, run parallel to the 
coast for several kilometres (Fig. 1b and Supplementary Fig. 1). 
They are separated by swales where peaty marsh deposits accumulate. 

The sequence of ridges builds progressively seaward at a rate that 
can be determined from the ages of deposits on the beach ridge plain: 
the oldest deposit in a swale ~1,800m distant from the current 
coastline is dated at between AD 780 and 990, or between 1,230 and 


1,020 years ago (Fig. 2a, c). Assuming that at the time ofits deposition 
the coastline was located at the foot of the beach ridge immediately 
before it—that is, at a distance of ~1,600 m from the current coast- 
line—the average coastal progradation rate can be estimated to lie 
between 1.3 to 1.6 myr_'. Sucha range has been described for similar 
beach ridge plains elsewhere'*"*. 

In swales between beach ridges we took more than 100 auger cores 
along four transects running perpendicular from the coast up to 2 km 
inland (Fig. 1b and Supplementary Fig. 1). Wet ground and partly 
submerged areas limited access and precluded trenching 
(Supplementary Fig. 1). The coring in the swales was limited to 
depths of 1-2 m by compact grey-greenish, fine to medium shoreface 
sand extending beneath the beach ridge plain (Fig. 2 and 


Cross-section in Fig. 2: 
96° 


Observed extent of 
== 2004 deposit 
=== Unit A 

ee= Unit B 

=== Unit C 


2] Beach ridges 


e Core site 


Figure 1| Index maps. a, Northern Sunda margin and vicinity. The red 
patch denotes the area of seismic slip during the 2004 Sumatra—Andaman 
earthquake of magnitude 9.2 (ref. 26). The green patch indicates the area of 
seismic slip during the March 2005 earthquake’. Rectangular patches 
outlined in black mark inferred rupture areas of historically reported 
earthquakes**”*. The blue line denotes Acehnese coast damaged by the 2004 
tsunami. SI, Simeulue Island; Me, Meulaboh; PT, Phra Thong Island, 
Thailand. b, Locations of sample sites and extents of sand sheets A, B and C 
along shore-normal transects. The beach ridges, sketched from aerial 
photographs (Supplementary Fig. 1), mark former shorelines from the past 
1,000 years or more. The 2004 tsunami inundation is mapped from the 
extent of dead trees in Supplementary Fig. 1. 
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Supplementary Fig. 2). Because swale age increases inland, however, 
coring progressively farther from the modern beach allowed us to 
extend the sedimentary record about 1,000 years into the past. 
North of Meulaboh the 2004 tsunami reached wave heights of 
between 9 and 14m (ref. 15) and inundated the area to a distance 
of 2.0-2.5 km inland (Fig. 1b and Supplementary Fig. 1). The tsu- 
nami deposited a sand sheet that can be followed up to 1.8 km inland, 
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and shows overall landward thinning from ~50 cm near the shore- 
line to a few millimetres farther inland (Fig. 2 and Supplementary 
Fig. 2). A continuous sand sheet can be found in swales up to 800m 
inland, whereas farther away from the shoreline the deposit becomes 
patchy. Although typically without obvious internal structure, in 
some locations the 2004 sand sheet shows normal grading with as 
many as three normally graded layers (Supplementary Fig. 3). Near 
the present shore the 2004 deposit consists of moderately-to-well- 
sorted medium sand dominated by silicate grains. Farther inland the 
sorting becomes poor; the mean grain size diminishes to fine sand 
(Supplementary Fig. 4), and the sand grains are mixed with soil 
fragments and plant debris. Microscopic fossils in 2004 deposits 
include pollen grains and diatoms. Although the diatoms are mostly 
of freshwater origin, one sample abounds in Nitzschia salinicola, a 
species that can grow in brackish water and is associated with sand 
grains (Supplementary Fig. 2 and Supplementary Table 1). 

Sediment cores from the swales revealed three older sand beds, a 
few millimetres to 25 cm thick, that are intercalated with peaty marsh 
deposits (units A, B and C in Fig. 2, and Supplementary Fig. 2). We 
correlated them both between cores along shore-normal transects 
and between transects connected by shore-parallel beach ridges 
(Figs 1b and 2, and Supplementary Figs 1 and 4). Though nowhere 
encountered in vertical stratigraphic succession, the three units are 
superposed geomorphically: in keeping with the progressive seaward 
shift in shoreline, the oldest of the three units (C) is the farthest 
inland, and the youngest unit (A) the closest to the modern shoreline. 
The three units can be distinguished not just by geomorphic position 
on the beach ridge plain but also by landward fining along transects 
(Supplementary Fig. 4) and by radiocarbon ages (Supplementary 
Table 2). 

Among the three pre-2004 sand sheets, unit A was found in just a 
few places 300-500 m from the modern beach, whereas units B and C 
extend more widely. Along transect 1 we found them 700-1,100m 
inland and 1,200-1,800 m inland, respectively (Figs 1b and 2). All 
three units resemble the 2004 tsunami deposit compositionally: they 
are composed of grey-brown, silicate-rich, fine to medium sand rich 
in pollen grains and plant debris. Individual units become more fine- 
grained and less well sorted with increasing distance to the shoreline 
(Supplementary Fig. 4). Also like the 2004 deposit, the sand layers are 
typically massive but might show normal grading. Unit B contains 
two depositional units locally (Fig. 2d). 

Among 14 microfossil samples, only 8 yielded diatoms 
(Supplementary Fig. 2). These are dominated by freshwater species 
(Supplementary Fig. 2 and Supplementary Table 1) except in one 
sample containing marine valves of Actinoptychus and Diplomenora 
(Fig. 2e, Supplementary Fig. 2 and Supplementary Table 1). We infer 
that the overall scarcity of diatoms in the pre-2004 sand units results 
from the dissolution of diatom silica at high temperatures and within 
an organic-rich environment’. 

We obtained AMS radiocarbon ages on pieces of wood and other 
plant debris, all probably detrital, from below and above sand beds. 
We interpret the lower dates as maximum ages and the upper ones as 
minimum ages for the times of sand-sheet deposition 
(Supplementary Table 2). The resulting age estimates, with ranges 
at two standard deviations, are younger than 1640-1950 ap for 


Figure 2 | Examples of sand sheets near Meulaboh. a, Topography of the 
beach ridge plain along shore-normal transect 1 (location shown in Fig. 1b), 
obtained by auto-level surveying, field observations, aerial photographs and 
a digital elevation model from helicopter surveys”’. Cores are shown by 
vertical lines. The observed extents of the 2004 sand sheet and of three earlier 
sheets (units A, B and C) are plotted above profile. Stippled lines mark the 
extent of sand sheets projected from near shore transects 2 and 3. The 
vertical elevation has been magnified 125-fold. MSL, mean sea level. 

b, c, Generalized stratigraphy and radiocarbon ages (in years AD at two 
standard deviations) along two parts of transect 1 indicated in 

a. d-f, Photographs of auger core samples from sand units B and C. 
Locations are shown in b and c. 
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unit A, between 1290-1400 and 1510-1950ap for unit B, and 
between 780-990 and 1000-1170 ap for unit C. 

Units A, B and C are best explained by tsunamis that ran onto the 
beach ridge plain and deposited sand on top of peaty marshes. 
Architecturally the three units resemble tsunami sand sheets described 
elsewhere: the sand extends hundreds of metres inland and rarely 
exceeds 25cm in thickness'’. Massive or normally graded beds and 
the lack of bed-load transport structures further indicate settling from 
suspension, which is characteristic of tsunami deposition'”’*. Finer- 
grained and less well sorted deposits with increasing distance from the 
shoreline have been observed in other tsunami deposits'’ and point 
towards waning flow velocities and towards the incorporation of plant 
debris and soil into the flow during the landward passage of a tsunami. 
Despite the meagre microfossil evidence of a marine origin of the sand 
layers, we rule out flood deposition because most of our sampling sites 
are located a few kilometres south of the nearest (Woyla) river 
(Supplementary Fig. 1). Furthermore, villagers report that river floods 
during the monsoon season have not reached our sampling sites. 

Sumatra’s geographical setting probably excludes the deposition of 
the sand beds by storms. The island is located within the equatorial 
zone where tropical cyclones rarely occur for lack of sufficient Coriolis 
Force’. One exception was Typhoon Vamei in 2001, which formed in 
the South China Sea, proceeded westward and eventually passed over 
Sumatra, 100 km south of Meulaboh, before dissipating over the Indian 
Ocean”. The storm weakened during its passage over Sumatra and 
caused no reported storm surge on the island’s west coast”'. 

How might the three pre-2004 sand units correlate with the written 
and geological records in the Indian Ocean region? Unit A, being 
younger than aD 1640-1950 and limited to a narrow area, might 
represent the documented smaller tsunami of AD 1907. Unit B, if 
deposited shortly after AD 1290-1400, may correlate with the young- 
est pre-2004 tsunami deposit inferred from sand sheets of Phra 
Thong Island, Thailand (Fig. la), dated to soon after AD 1300-1450 
(ref. 22). Unit C, deposited after AD 780-990, has no dated equivalent 
in Thailand. However, marine terraces in the Andaman Islands, the 
northern part of the 2004 rupture area (Fig. 1a), have been dated to 
AD 1170-1600 and ap 550-1330 and have been interpreted as evid- 
ence for subduction earthquakes”. Acehnese units B and C might 
correlate with these terraces. 

The combined evidence from Meulaboh and Phra Thong, 
although correlated tentatively, suggests that recurrence intervals of 
destructive tsunamis from Sumatra—Andaman sources can span cen- 
turies, with the 2004 Indian Ocean tsunami separated from its young- 
est full predecessor by perhaps 600 years. Such a long recurrence 
interval, which exceeds Indonesian tsunami records by about 200 
years, would explain not just the lack of historical data but also the 
enormity of the 2004 Sumatra—Andaman earthquake™. 

Infrequent tsunami recurrence poses a dilemma for communities 
devastated by the 2004 tsunami. What balance should survivors seek 
between the risks of the next tsunami that might not return for a few 
generations, and the benefits of living close to sea? A partial solution 
is to ensure that communities sustain awareness of tsunami hazards. 
Such awareness saved thousands of lives on Simeulue Island (Fig. 1a), 
where people knew to interpret the severe ground shaking of the 2004 
earthquake as a natural tsunami warning”’. This knowledge, derived 
from Simeulue’s great losses to the 1907 tsunami, led the island’s 
residents flee to high ground in time to escape the 2004 tsunami. 
On the Sumatran mainland, a large fraction of the tsunami’s victims 
may have survived in this fashion had the area’s history of recurring 
tsunamis been detected and communicated in time. 


Received 7 March; accepted 27 August 2008. 


1. Tsuji, Y. et al. Distribution of the tsunami heights of the 2004 Sumatra tsunami in 
Banda Aceh measured by the tsunami survey team (http://www.eri.u-tokyo.ac.jp/ 
namegaya/sumatera/surveylog/eindex.htm) (2005). 

2. Yalciner, A. C. et al. December 26, 2004 Indian Ocean tsunami field survey (Jan. 
21-31, 2005) at north of Sumatra Island (http://ioc.unesco.org/iosurveys/ 
Indonesia/yalciner/yalciner.htm) (2005). 


1234 


NATURE|Vol 455|30 October 2008 


3. Hamzah, L., Puspito, N. & Imamura, F. Tsunami catalogue and zones in Indonesia. 
J. Nat. Disast. Sci. 22, 25-43 (2000). 

4. Newcomb, K. R. & McCann, W. R. Seismic history and seismotectonics of the 
Sunda Arc. J. Geophys. Res. 92, 421-439 (1987). 

5. Bilham, R., Engdahl, R., Feldl, N. & Satyabala, S. P. Partial and complete rupture of 
the Indo-Andaman plate boundary 1847-2004. Seismol. Res. Lett. 76, 299-311 
(2005). 

6. Ortiz, M. & Bilham, R. Source area and rupture parameters of the 31 December 
1881 My = 7.9 Car Nicobar earthquake estimated from tsunamis recorded in the 
Bay of Bengal. J. Geophys. Res. 108, 1-16 (2003). 

7. Briggs, R. W. et al. Deformation and slip along the Sunda megathrust in the great 
2005 Nias-Simeulue earthquake. Science 31, 1897-1901 (2006). 

8. U.S. Geological Survey. Notes From the Field ... USGS Scientists in Sumatra Studying 
Recent Tsunamis: Leg 2 Reports, 12 April to 30 April 2005 <http:// 
walrus.wr.usgs.gov/news/reports.html> (2005). 

9. Atwater, B. F. & Moore, A. L. A tsunami about 1000 years ago in Puget Sound, 

Washington. Science 258, 1614-1617 (1992). 

O. Pinegina, T. K. & Bourgeois, J. Historical and paleo-tsunami deposits on 
Kamchatka, Russia: long-term chronologies and long-distance correlations. Nat. 
Hazards Earth Syst. Sci. 1, 177-185 (2001). 

1. Nanayama, F. et al. Unusually large earthquakes inferred from tsunami deposits 
along the Kuril trench. Nature 424, 660-663 (2003). 

2. Cisternas, M. et al. Predecessors of the giant 1960 Chile earthquake. Nature 437, 
404-407 (2005). 

3. Anthony, E. J. Beach-ridge development and sediment supply: examples from 
West Africa. Mar. Geol. 129, 175-186 (1995). 

4. Tanner, W. F. Origin of beach ridges and swales. Mar. Geol. 129, 149-161 (1995). 

5. Kongko, W., Istiyanto, D. C. & Irwandi, |. Tsunami Modeling and Field Observations 

of December 26 2004 Indian Ocean Earthquake (Technical Report, Coastal 

Dynamic Research Center (BPPT), Jogjakarta, 2006). 

6. Bennett, P. C., Siegel, D. |., Hill, B. M. & Glaser, P. H. Fate of silicate minerals in a 

peat bog. Geology 19, 328-331 (1991). 

7. Morton, R. A., Gelfenbaum, G. & Jaffe, B. E. Physiacal criteria for distinguishing 

sandy tsunami and storm deposits using modern examples. Sedim. Geol. 200, 

84-207 (2007). 

8. Goff, J., McFadgen, B. G. & Chagué-Goff, C. Sedimentary differences between the 

Easter storm and the 15th century Okoropunga tsunami, southeastern North 
Island, New Zealand. Mar. Geol. 204, 235-250 (2004). 

9. Anthes, R. A. Tropical Cyclones: Their Evolution, Structure and Effects (Am. 
Meteorol. Soc., 1982). 

20. Chang, C.-P., Liu, C.-H. & Kuo, H.-C. Typhoon Vamei: An equatorial tropical 
cyclone formation. Geophys. Res. Lett. 30, 1151-1154, doi:10.1029/2002GL016365 
(2003). 

21. Grandau, F. & Engel, G. 2007 Annual Tropical Cyclone Report. (U.S. Naval Pacific 
Meteorology and Oceanography Center/Joint Typhoon Warning Center, 2002). 

22. Jankaew, K. et al. Medieval forewarning of the 2004 Indian Ocean tsunami in 
Thailand. Nature doi:10.1038/natureO07373 (this issue). 

23. Rajendran, K. et al. Age estimates of coastal terraces in the Andaman and Nicobar 
islands and their tectonic implications. Tectonophysics 455, 53-60 (2008). 

24. Stein, S. & Okal, E. A. Ultralong period seismic study of the December 2004 Indian 
Ocean earthquake and implications for regional tectonics and the subduction 
process; The 2004 Sumatra-Andaman earthquake and the Indian Ocean 
tsunami. Bull. Seismol. Soc. Am. 97, 279-295 (2007). 

25. McAdoo, B. G., Dengler, L., Prasetya, G. & Titov, V. Smong: How an oral history 
saved thousands on Indonesia's Simeulue Island during the December 2004 and 
March 2005 tsunamis. Earthg. Spectra 22, 661-669 (2006). 

26. Chlieh, M. et al. Coseismic slip and afterslip of the great My, 9.15 
Sumatra-Andaman earthquake of 2004. Bull. Seismol. Soc. Am. 97, 152-173 
(2007). 

27. Siemon, B., Steuer, A., Voss, W. & Rehli, H. J. Helicopter-Borne Geophysical 
Investigation in The Province of Nanggroe Aceh Darussalam, Northern Sumatra, 
Indonesia. Survey area Calang-Meulaboh (Technical Report, Federal Institute for 
Geosciences and Natural Resources (BGR), Hanover, 2006). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank the field offices of Caritas Switzerland and the 
Catholic Relief Services (CRS) in Meulaboh for logistical support during fieldwork; 
D. Pléthner and U. Meyer for providing maps and digital elevation data for the study 
area; the SIM data centre of the Aceh and Nias Rehabilitation and Reconstruction 
Board (BRR), in Banda Aceh, for providing aerial photographs; T. Ninck for help in 
the field; J. Beitel for assistance with laboratory work; K. Moran, S. Lithi, S. Phipps, 
A. Smith, N. Wells and J. Ortiz for discussions; and B. Atwater, S. Bondevik, Y. Sawai 
and K. Sieh for reviews. The work was funded by the Swiss Science Foundation, the 
US National Science Foundation, and Vassar College. 


Author Contributions K.M., W.F., W.K. and S.S. did the field work. D.K. performed 
the diatom analyses. A.M. and K.M. conducted the grain-size analyses. W.F. and 
B.M. did the mapping. K.M. prepared the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. Correspondence and requests for materials should be 
addressed to K.M. (monecke@pitt.edu). 


©2008 Macmillan Publishers Limited. All rights reserved 


Vol 455|30 October 2008|doi:10.1038/nature07281 


nature 


LETTERS 


Agrochemicals increase trematode infections in a 
declining amphibian species 


Jason R. Rohr’, Anna M. Schotthoefer®, Thomas R. Raffel”, Hunter J. Carrick*, Neal Halstead!, 
Jason T. Hoverman’, Catherine M. Johnson®, Lucinda B. Johnson®, Camilla Lieske*, Marvin D. Piwoni’, 


Patrick K. Schoff® & Val R. Beasley” 


Global amphibian declines have often been attributed to disease’”’, 
but ignorance of the relative importance and mode of action of 
potential drivers of infection has made it difficult to develop effec- 
tive remediation. In a field study, here we show that the widely 
used herbicide, atrazine, was the best predictor (out of more than 
240 plausible candidates) of the abundance of larval trematodes 
(parasitic flatworms) in the declining northern leopard frog Rana 
pipiens. The effects of atrazine were consistent across trematode 
taxa. The combination of atrazine and phosphate—principal agro- 
chemicals in global corn and sorghum production—accounted for 
74% of the variation in the abundance of these often debilitating 
larval trematodes (atrazine alone accounted for 51%). Analysis of 
field data supported a causal mechanism whereby both agroche- 
micals increase exposure and susceptibility to larval trematodes by 
augmenting snail intermediate hosts and suppressing amphibian 
immunity. A mesocosm experiment demonstrated that, relative to 
control tanks, atrazine tanks had immunosuppressed tadpoles, 
had significantly more attached algae and snails, and had tadpoles 
with elevated trematode loads, further supporting a causal rela- 
tionship between atrazine and elevated trematode infections in 
amphibians. These results raise concerns about the role of atrazine 
and phosphate in amphibian declines, and illustrate the value of 
quantifying the relative importance of several possible drivers of 
disease risk while determining the mechanisms by which they 
facilitate disease emergence. 

There is growing appreciation of the importance of infectious 
diseases in driving population dynamics and declines of wildlife spe- 
cies**. However, investigators rarely assess the relative importance of 
the multitude of potential drivers and demonstrate a causal relation- 
ship between these factors and emerging infections so that the most 
problematic driver(s) could be effectively managed to prevent dis- 
eases of conservation concern’. Identifying the main risk factors for 
disease in amphibians is especially important because their wide- 
spread population declines have often been attributed to infectious 
disease’”. Trematode infections of amphibians have attracted 
research and public attention predominantly because they can cause 
grotesque limb malformations, kidney damage and debility®’. 
Furthermore, certain trematode infections are considered to be 
emerging diseases of amphibians driven by anthropogenic factors”"°, 
and many can be lethal with the probability of amphibian mortality 
increasing with trematode load®'?”. 

To understand the relative importance of drivers of amphibian 
trematode infections, we studied 18 wetlands in Minnesota, USA 
(Supplementary Table 1 and Supplementary Fig. 1), in which we 


measured more than 240 plausible predictors of amphibian trem- 
atode infections (Supplementary Table 2). We quantified larval trem- 
atode abundance in northern leopard frogs (R._ pipiens; 
Supplementary Fig. 2), a species in decline across much of its range’, 
and determined which of the factors best predicted the abundance of 
these parasites in these amphibians. The potential predictors we 
quantified included (1) the abundance of intermediate hosts of tre- 
matodes; (2) water quality and pollutants (Supplementary Table 3), 
which might suppress host immunity or be directly lethal to hosts or 
free-living stages of trematodes"; and (3) landscape attributes that 
might be crucial for supporting amphibians and avian and mam- 
malian definitive hosts. 

Our analyses showed that atrazine, the second most commonly 
used pesticide in the United States’ and perhaps the world, com- 
bined with one of its metabolites, desethylatrazine (both from water 
samples), was the best predictor of total larval trematode abundance 
in R. pipiens. The sum of atrazine and desethylatrazine accounted for 
more variation than any other single predictor (51%, Monte Carlo 
randomization test, P = 0.001, Fig. la and Supplementary Fig. 3) and 
occurred in more models than any other predictor in a best subset 
analysis (Supplementary Table 4). For subsequent analyses, we 
focused on total larval trematode abundance because of the similar 
responses of larval trematode taxa to atrazine (Supplementary Table 
5), and on atrazine and desethylatrazine combined because the com- 
bination was a better predictor of larval trematode loads than either 
variable alone and because the two compounds have similar half- 
lives and toxicity'®. Hereafter, for brevity, we refer to the sum of 
detectable atrazine and desethylatrazine concentrations in water as 
‘atrazine’. 

We proposed that atrazine augmented the richness and abundance 
of gastropods, the first intermediate host of trematodes, increasing 
the abundance of free-living trematode cercariae and the consequent 
exposure of R. pipiens to trematodes (Supplementary Fig. 4)'°. This 
hypothesis was on the basis of the observation that atrazine and other 
herbicides regularly cause net increases in periphytic algae’’, a food 
source for many gastropods, despite herbicides being directly toxic to 
many periphyton species (see Supplementary Methods for mechan- 
isms). In addition, we also suggested that atrazine increased the 
susceptibility of R. pipiens to trematode infections (Supplementary 
Fig. 4). This hypothesis was on the basis of laboratory studies which 
showed that low atrazine concentrations caused immunosuppression 
in ranid frog species’*!” and an ambystomatid salamander”, increas- 
ing larval trematode loads and susceptibility to viral infections, 
respectively’?”?. 


‘Biology Department, University of South Florida, Tampa, Florida 33620, USA. @Penn State Center for Infectious Disease Dynamics, Penn State University, University Park, 
Pennsylvania 16802, USA. *College of Veterinary Medicine, University of Illinois at Urbana-Champaign, Urbana, Illinois 61802, USA. *School of Forest Resources, Penn State 
University, University Park, Pennsylvania 16802, USA. Department of Forest, Wildlife and Fisheries, The University of Tennessee, Knoxville, Tennessee 37996-4563, USA. °Natural 
Resources Research Institute, University of Minnesota Duluth, Duluth, Minnesota 55811, USA. “Illinois Waste Management and Research Center, Champaign, Illinois 61820, USA. 
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We used a path analysis to explore the amount of support for 
these proposed mechanisms for how atrazine might increase larval 
trematode loads”’. Despite the low sample size of the path analysis, 
almost every path was significant (Fig. 1b) and the parameters were 
generally robust to bootstrapping. Parameter significance was also 
unaffected when controlling for the relationship between atrazine 
and phosphate (Supplementary Table 6), the latter of which is a 
primary ingredient in fertilizers that can also increase larval trem- 
atode loads’*. The path model showed that atrazine was positively 
related to gastropod abundance and richness, and that gastropod 
richness was positively associated with larval trematode loads 
(Fig. 1b). Independent of this exposure-mediated effect on infection 
risk was a significant positive association between atrazine and larval 
trematode abundance (Fig. 1b), consistent with atrazine increasing 
the susceptibility of R. pipiens to trematodes. 

To test this susceptibility hypothesis, we quantified melanoma- 
crophage aggregates in the livers of frogs from each wetland 
(Supplementary Fig. 5). Melanomacrophages are cells of macrophage 
lineage (that is, Kupffer cells) found in the kidneys, spleen and liver 
that seem to be involved in inflammatory responses, in responding to 
foreign and dead material, and in fighting infections by a variety of 
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Figure 1| Proposed mechanisms for the relationship between atrazine and 
larval trematodes in amphibians. a, The relationship between detectable 
concentrations of atrazine plus desethylatrazine in water samples and larval 
trematode abundance (‘probit’ transformed, that is, the normal standard 
deviate of mean abundance/1,000) in northern leopard frogs, R. pipiens. 
Each data point represents a wetland, and a 95% confidence band is shown. 
b, Results of a path analysis examining the suggested mechanisms by which 
atrazine plus desethlyatrazine increase larval trematode abundance. 
Unstandardized and standardized coefficients, respectively, are provided 
next to each path. Solid arrows represent significant paths. The model chi- 
square was 0.339 (d.f. = 1, P = 0.561), indicating that the model was a good 
fit to the data. 
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parasites including encysted larval helminths***; furthermore, they 
have been proposed as reliable biomarkers for water pollution”. We 
did not incorporate melanomacrophage aggregates into the path 
analysis because, unlike the other variables in the model (Fig. 1b), 
melanomacrophage aggregates seemed to be significantly increased 
by the richness and abundance of nematodes and adult trematodes 
that also infected the frogs (Supplementary Table 7). Consequently, 
we controlled for these co-infections when testing for relationships 
between melanomacrophage aggregates and atrazine, and between 
melanomacrophage aggregates and larval trematode loads. 
Hereafter, nematode- and adult trematode-independent melanoma- 
crophage aggregate levels will be referred to as melanomacrophage 
aggregate scores. 

We discovered that atrazine was a significant negative predictor of 
melanomacrophage aggregate scores in R. pipiens (Fig. 2a). 
Phosphate (in water samples) was also a significant negative pre- 
dictor of melanomacrophage aggregate scores (Fig. 2b). Similarly, 
in a laboratory study, both atrazine and fertilizer decreased peri- 
pheral leukocyte concentrations in tiger salamanders”. Moreover, 
the melanomacrophage aggregate score was a significant negative 
predictor of larval trematode abundance in R. pipiens (Fig. 2c), sup- 
porting the assertion that pollution-related immunosuppression was 
an important contributor to elevated larval trematode loads. 

We found little evidence that atrazine was correlated with any 
other quantified variables that could parsimoniously explain the 
observed patterns; no variables that were correlated with atrazine 
and larval trematode loads were correlated with gastropod richness 
and melanomacrophage aggregate scores (Supplementary Table 8). 
Specifically, water column phosphate—which also can elevate 
amphibian trematode loads by increasing periphytic algae and gast- 
ropod abundance'’—was not significantly correlated with atrazine 
and, when tested alone, was not a significant predictor of the 
abundance of larval trematodes in R. pipiens (Supplementary 
Table 8). However, when we included phosphate and atrazine in 
our statistical model, both variables were significant positive pre- 
dictors of larval trematode burdens in R. pipiens (F,,\4 = 12.17, 
P=0.004, and F,14= 31.06, P<0.001, respectively), indicating 
that they accounted for relatively uncorrelated components of vari- 
ation in larval trematode loads in the study region. Moreover, the 
combination of atrazine and phosphate accounted for 74% of the 
variation in larval trematode abundance (F,14 = 19.75, P< 0.001), 
and atrazine and phosphate were in 19 and 17 of the top 20 best 
subset models for larval trematode abundance, respectively 
(Supplementary Table 4). 

We conducted a community-level mesocosm experiment to test 
whether the detected correlation between atrazine and amphibian 
trematode loads was indeed causal. Our mesocosm experiment 
showed that, in just one generation, cattle tanks with a single dose 
of atrazine at an expected environmental concentration had more 
than four times as many snails as control tanks (Table 1). Atrazine 
significantly reduced phytoplankton abundance, which resulted in 
significantly increased nutrient availability, water clarity and sunlight 
penetration to attached algae (Table 1). Tanks with atrazine had 
significantly greater periphyton levels than control tanks (Table 1), 
which presumably fuelled the elevated snail reproduction and per- 
haps increased snail richness in ponds with elevated atrazine (due to 
productivity—diversity relationships; Supplementary Methods). 

As in the field survey, there was a significant association between 
atrazine exposure and elevated frog susceptibility in our mesocosm 
experiment. The mesocosm experiment was conducted in the south- 
ern range of R. pipiens where this species is declining’’. Consequently, 
we conducted our mesocosm experiment on two congeners, Rana 
palustris (pickerel frog) and Rana clamitans (green frog). Rana palus- 
tris had significantly lower liver melanomacrophage counts in atra- 
zine tanks compared to control tanks (Table 1), and a similar trend 
was observed for their liver eosinophils (Table 1), another important 
immune cell for fighting larval trematodes'’’. Rana palustris survival 
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Figure 2 | Relationships between melanomacrophage aggregates and 
atrazine, phosphate and larval trematode loads. a, The relationship 
between melanomacrophage aggregate score in R. pipiens and 
concentrations of atrazine plus desethylatrazine in water samples. b, The 
relationship between melanomacrophage aggregate score in R. pipiens and 
phosphate concentration in water samples. c, The relationship between 
melanomacrophage aggregate score in R. pipiens and larval trematode 
abundance. In these graphs and analyses, we used the residuals from the 
relationship among mean melanomacrophage aggregate scores and the 
richness and abundance of nematodes and adult trematodes in leopard frogs 
(see text). Each data point represents a wetland, and 95% confidence bands 
are shown. 
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was significantly lower in the tanks with atrazine than in the tanks 
without atrazine (Table 1), preventing us from conducting an uncon- 
founded test of the effect of atrazine on trematode loads in this 
species because it is probable that the most highly infected indivi- 
duals were those that died. 

For R. clamitans, atrazine exposure was not significantly associated 
with reduced liver melanomacrophages or elevated mortality, but it 
was associated with significant reductions in liver eosinophil counts 
(Table 1). Hence, in both ranid species, atrazine significantly sup- 
pressed immune parameters known to be important for fighting 
larval trematodes'”**, a result consistent with previously documen- 
ted immunosuppression in R. pipiens and Rana sylvatica exposed to 
atrazine'*'’. Furthermore, in atrazine tanks, R. clamitans had signifi- 
cantly more plagiorchid trematodes than in control tanks when con- 
trolling for the relationship between plagiorchid loads and 
eosinophils (Table 1), suggesting that atrazine had effects on trem- 
atode infections that were independent of its effects on eosinophils 
(Supplementary Methods). 

Larval trematode loads in amphibians must depend on the 
abundance of definitive hosts, such as amphibian-eating birds, 
because they are the source of trematode eggs in wetlands; thus, 
an important caveat is that atrazine and phosphates cannot increase 
trematode infections unless there is an ample supply of trematodes 
to begin with. This dependence is supported by the significant 
interaction between atrazine and the number of habitat patches 
which were ‘suitable’ for definitive hosts (forest, wetlands and open 
water) around each wetland (Monte Carlo randomization test: 1 km 
radius, P= 0.004; 10km radius, P= 0.019), the latter of which is a 
proxy for visitations by, and for the abundance of, definitive hosts. 
Knowledge of this context-dependency should facilitate appropri- 
ately targeting remediation strategies for amphibian trematode 
infections. 

Our findings suggest that atrazine, at concentrations commonly 
occurring in freshwater ecosystems, can be a primary driver of larval 
trematode infections for a declining amphibian species, and that 
phosphate-associated eutrophication has an important, and perhaps 
complementary, role (because it was only significant when atrazine 
was included as a covariate). Both atrazine and phosphate appear to 
increase exposure and susceptibility to larval trematodes by aug- 
menting snail intermediate hosts'® and by suppressing amphibian 
immune responses. By evaluating more than 240 alternative hypo- 
theses, a number which could never realistically be examined in a 
manipulative experiment, and by demonstrating consistent results 
and underlying mechanisms between our controlled mesocosm 
experiment and wetland survey, we have substantially reduced the 
probability that the relationships identified are spurious. Hence, we 
provide support for the hypothesis that reducing atrazine and phos- 
phate inputs to wetlands might effectively remediate elevated and 
often debilitating amphibian trematode infections. 

Notably, the potentially important nexus between amphibian 
parasitism and pollution demonstrated here would not have been 
detected in standard studies used to register chemicals in the United 
States and Europe because these studies are typically conducted on 
individuals isolated from other species with which they naturally 
coexist, such as their parasites**’®. Therefore, greater effort is needed 
to understand the potentially important effects of pollution on spe- 
cies interactions that may contribute to reduced host survival’*”*. 
Additionally, the results of this study—along with other studies link- 
ing pesticide use with amphibian declines” and linking low levels of 
atrazine with endocrine disruption, hermaphroditism” and mortal- 
ity in amphibians”*’—raise serious concerns about the relationship 
between atrazine and global amphibian losses. Altogether, our work 
demonstrates the value of quantifying the relative importance of 
several plausible drivers of disease risk and population declines using 
a combination of field surveys and manipulative studies to enable 
rational and prioritized environmental remediation. 
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Table 1| Findings from mesocosm experiment examining the effect of atrazine on factors proposed to influence larval trematode loads in frogs. 


Response variable Atrazine (n = 4) Control (n = 8)* Test statistic Py 
Mean s.e.m. Mean s.e.m. 

Snail hatchlings m? 468.09 205.85 110.64 29.31 20.19% <0.001 
Snail egg masses m 7 30.45 3.03 12.37 4.34 4.98t 0.026 
Chlorophyll a in phytoplankton (ug 17+) 2.52 0.45 7.14 1.67 4.888 0.031 
Water clarity (scale 1-5) 4.75 0.25 3.88 0.23 5.2597 0.011 
Chlorophyll a in periphyton (mgm 7) 1.04 0.18 0.70 0.08 7.178 0.032 
Melanomacrophages per field of view for R. palustris (log) 0.16 0.06 0.36 0.06 4.508 0.031 
Eosinophils per field of view for R. palustris (log) 0.03 0.14 0.21 0.06 888 0.102] |# 
Survival of R. palustris (%) 47.50 7.50 70.00 3.90 9775 0.017 
Melanopmacrophages per field of view for R. clamitans (log) 0.25 0.13 0.38 0.09 0.678 0.217] |* 
Eosinophils per field of view for R. clamitans (log) 0.11 0.10 0.39 0.07 4.758 0.029 
Survival of R. clamitans (%) 76.25 3.15 76.25 2.80 <0.0018 O:971** 
No. of plagiorchid trematode cysts in R. clamitans 20.61 4.52 7.51 3.30 4.908 0.031] | ++ 
* The solvent and water controls were pooled because there was no significant difference between these treatments. 
+ Block was included in the model if its P value was less than 0.25. 
£77 statistic from analyses using the generalized linear model with a Poisson error distribution and a log link. 
8 F statistic from analyses using the general linear model. 


| | One-tailed test. 


4° statistic from analyses using the generalized linear model with an ordinal multinomial error distribution and a logit link. 
#The power of the test was 0.646; the minimum sample size necessary per group to detect a significant difference given the power was six. 


* The power of the test was 0.326. 
** The power of test was 0.056. 


++ Controlling for biomass of plagiorchid-infected snails (F;,7 = 5.45, P = 0.026) and mean log eosinophil counts in tadpoles (F;,7 = 3.58, P = 0.050). 


METHODS SUMMARY 


In each wetland in 1999, we characterized biological diversity (three visits: 
March-April, May—June and July-August; Supplementary Tables 2 and 3) and 
obtained water, sediment and amphibian tissue samples to quantify various 
analytes (two visits: April-May and June-July; Supplementary Table 3). 
Analyte quantification was conducted by the Illinois Waste Management and 
Research Center using standard US Environmental Protection Agency protocols. 
During wetland visits, we attempted to collect 15 recently metamorphosed 
R. pipiens for parasite quantification and 25 for pathology. 

Our mesocosm experiment was conducted at Pennsylvania State University 
using a randomized block design and twelve 1,100-litre plastic cattle tanks (8001 
of water) covered with shade cloth. Three weeks before dosing, each tank 
received 300 g of mixed hardwood leaves, plexiglass to sample snail egg masses 
and hatchlings, periphyton samplers and inoculations of zooplankton, peri- 
phyton and phytoplankton from four ponds. Just before dosing, each tank 
received four larval amphibian (5 Ambystoma maculatum, 20 Hyla versicolour, 
20 R. palustris and 20 R. clamitans), two snail (11 Planorbella trivolvis and 10 
Physa gyrina), one beetle (5 Hydrochara sp.), two water bug (2 Belostoma flumi- 
neum and 7 Notonecta undulate) and one dragonfly species (2 Anax junius). We 
applied a single dose of technical grade atrazine at 102 gl’, an expected envir- 
onmental concentration (mean actual concentration was 117 ug 1”!). The 
experiment ran for 4 weeks, from the end of June to July 2007. 

All variables were quantified blind to the wetland characteristics or mesocosm 
treatment. Wetlands and mesocosms were used as the sample units for statistical 
analyses. That is, if several measurements were taken on a wetland or a tank, 
statistical analyses were conducted on the mean of those measurements. The 
path analysis was conducted assuming linear relationships among the raw data 
and using maximum likelihood estimation and bootstrapping”'. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Wetland survey. Quantification of faunal diversity (amphibians and macroin- 
vertebrates) was conducted using daytime visual time-constrained and area- 
constrained dip-net sampling methods, adjusting effort according to the size 
of the wetland. Three 10-m transects were randomly placed within each com- 
munity type in and around each wetland, and the line-intercept method was used 
to record the relative cover of each plant species under or over the line. 

Landscape structure—including landscape composition, fragmentation pat- 
terns, patch density and connectivity patterns—was quantified around each 
wetland using Fragstats (version 2). Land cover data were derived from the 
1990s National Land Cover Database, which was based primarily on 1992 
Landsat-5 Thematic Mapper data, with a 30-m resolution. These data were used 
to summarize landscapes at 1-km and 10-km extents. The 1-km extent was 
chosen because it represents the approximate dispersal capabilities of leopard 
frogs, and it allowed for exploration of effects of adjacent land use on the parasite 
communities. The 10-km scale was examined because this larger spatial scale 
might affect the highly mobile definitive hosts of many frog parasites (for 
example, birds and mammals). 

The buccal and abdominal cavities were examined, and all visceral organs, skin 
and eyes were removed from necropsied frogs and inspected for parasites. After 
necropsy, the remaining frog carcasses were fixed in 10% neutral buffered form- 
alin and cleared and stained. Encysted metacercariae in the musculature were 
identified and enumerated by examining the cleared and stained specimens 
under a dissecting microscope (Supplementary Fig. 2). 

Frog livers were embedded in paraffin, stained with haematoxylin and eosin, 
sectioned and then examined under a light microscope. Melanomacrophage 
aggregate numbers were scored from zero to five, in which zero indicated that 
no melanomacrophage aggregates were present and five indicated that more 
than 90% of the liver was filled with melanomacrophage aggregates 
(Supplementary Fig. 5). Livers that fell in between categories were assigned 
intermediate values. All quantification was done on frogs of Gosner stage 46. 

Our statistical analysis for the wetland survey was done in four steps. In the 
first step, we calculated the coefficient of determination (R’) for each predictor in 
the absence of covariates to determine which single predictor accounted for the 
most variation in larval trematode abundance. In our second step, we used the 
general linear model to generate the 20 best subset models for mean larval 
trematode loads in R. pipiens. Models were ranked by adjusted R’ values and 
were limited to a maximum of three predictors (Supplementary Table 4). In our 
third step, we used a structural equation model, which was later reduced to a path 
analysis on the basis of a confirmatory factor analysis (see Supplementary 
Methods and Supplementary Fig. 6), to evaluate the amount of support for 
the specific proposed causal mechanisms underlying the significant positive 
relationship between atrazine concentration and larval trematode abundance 
(Fig. 1b). Phosphate was not included in our path model because the primary 
emphasis was on atrazine and our sample size was too small to include both 
chemicals in the model. Path analysis can be sensitive to relatively small samples 
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sizes, raising concerns about the precision of parameter estimates and the 
stability of our path model. To evaluate the robustness of the parameter esti- 
mates and model stability, we obtained standard error estimates for all the paths 
in Fig. 1b using both maximum likelihood and bootstrapping methods (200 
resamplings)*'. Another matter is that herbicides and fertilizers might positively 
co-vary in the landscape, raising concerns about the validity of the path model 
if the relationship between phosphate concentration and atrazine was not 
controlled. We re-calculated all the parameter estimates and standard errors 
controlling for the relationship between atrazine and phosphate concentrations. 
Structural equation modelling and path analysis and bootstrapping were done 
using SEPATH in Statistica 6.1 (StatSoft Inc.). The final step was to assess 
whether atrazine was correlated with any other variables that might more parsi- 
moniously explain the observed patterns. We used correlation analysis to 
identify any variables that were positively correlated with atrazine, larval trem- 
atode loads, gastropod richness and melanomacrophage aggregate scores. 

Mesocosm experiment. All of the species in the mesocosm experiment naturally 
coexist and were applied at densities found locally. The applied expected envir- 
onmental concentration of atrazine was calculated using the US Environmental 
Protection Agency’s GENEEC (version 2) software (Supplementary Methods). 

All snails used in the experiment were screened twice for patent (shedding) 
trematode infections before being placed in the tanks. In addition to the 11 
screened planorbid snails that were free to move in each tank, each tank also 
received a caged planorbid snail shedding cercariae from the family 
Plagiorchidae. These caged snails were rotated among the tanks every other 
day in an attempt to homogenize exposure to trematodes. Snails were thor- 
oughly rinsed before being moved to another tank and the cages were not 
rotated. 

At the end of the experiment, we took water samples for phytoplankton 
analyses, scraped periphyton from the periphyton samplers, scored water clarity 
ona scale from | to 5, and counted the snail egg masses on the walls of each tank, 
the snail hatchlings from the plexiglass placed in each tank and the surviving and 
dead animals. Chlorophyll a concentrations were measured using standard 
fluorometric techniques. Five preserved R. clamitans and five R. palustris 
tadpoles from each tank were selected randomly and melanomacrophage and 
eosinophils cells were counted per field of view from their sectioned and stained 
livers. Each tadpole was also cleared and stained and their larval trematodes were 
counted under a compound scope, as described previously. 

We re-screened each snail from trematodes at the end of the experiment and 
discovered that some of the snails that were placed in the tanks were pre-patent 
(infected but not shedding treamtodes) and began shedding plagiorchid cercar- 
iae during the experiment. Consequently, all snails were weighed and dissected to 
ensure that we did not miss any infected snails that were not shedding at the end 
of the experiment. One tank had a snail infected with strigeid trematodes (deter- 
mined on the basis of snail dissection and shedding) and we could not discrim- 
inate these trematode metacercariae from the plagiorchid metacercariae, and 
thus, this tank had to be dropped from the trematode load analyses. 
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Small-amplitude cycles emerge from stage- 
structured interactions in Daphnia-algal systems 


Edward McCauley’, William A. Nelson* & Roger M. Nisbet? 


A long-standing issue in ecology is reconciling the apparent 
stability of many populations with robust predictions of large- 
amplitude population cycles from general theory on consumer— 
resource interactions’. Even when consumers are decoupled from 
dynamic resources, large-amplitude cycles can theoretically emerge 
from delayed feedback processes found in many consumers”*. Here 
we show that resource-dependent mortality and a dynamic devel- 
opmental delay in consumers produces a new type of small-ampli- 
tude cycle that coexists with large-amplitude fluctuations in 
coupled consumer-resource systems. A distinctive characteristic 
of the small-amplitude cycles is slow juvenile development for con- 
sumers, leading to a developmental delay that is longer than the 
cycle period. By contrast, the period exceeds the delay in large- 
amplitude cycles. These theoretical predictions may explain pre- 
vious empirical results on coexisting attractors found in Daphnia-— 
algal systems*°. To test this, we used bioassay experiments that 
measure the growth rates of individuals in populations exhibiting 
each type of cycle. The results were consistent with predictions. 
Together, the new theory and experiments establish that two very 
general features of consumers—a resource-dependent juvenile 
stage duration and resource-dependent mortality—combine to 
produce small-amplitude resource-consumer cycles. This phe- 
nomenon may contribute to the prevalence of small-amplitude 
fluctuations in many other consumer-resource populations®”. 

Incorporating size- or age-structured interactions into theory on 
the dynamics of biological populations has provided new insight into 
the causes of population fluctuations and patterns in natural systems. 
Theory on single-species dynamics reveals that interactions among 
life-history stages or age classes and the resultant feedback on demo- 
graphic rates** can generate a diversity of cycle types. Many of these 
findings carry over to situations with indirect interactions among 
stages, for example through competition for food. 

The range of possible mechanisms for explaining cyclic dynamics 
expands markedly once models include explicit interactions with other 
species. For example, stage-structure and explicit consumer—resource 
interactions can each destabilize an equilibrium, leading to population 
cycles*?"'. Intuitively one might think that combining these sources of 
instability would enhance instability in dynamics. Here, we dem- 
onstrate that this combination does not simply increase the amplitude 
of cycles, but introduces new phenomena that arise from the inter- 
action of the various sources of instability. We show that combining 
one of the most general life histories for a predator with consumer— 
resource dynamics produces new types of cycles that may help to 
explain the rarity of large-amplitude cycles in many systems. 

Our approach combines theory on the dynamics of stage- 
structured populations of the herbivore Daphnia pulex and their algal 
prey with experiments conducted in microcosms to test mechanisms 
producing different types of cycles. By coupling small-volume 


flow-through systems to microcosms containing the Daphnia—algal 
populations, we measure the growth rate of individual herbivores 
experiencing food fluctuations set by the dynamic interactions 
unfolding in the microcosms, and compare these estimates to model 
predictions for different types of cycles. 

Previous work*''” has identified three types of cycles that can 
emerge from stage-structured competition for food in a population 
being reared under semi-chemostat conditions. Which cycles are 
found depends on the relative feeding rates of juveniles and adults, 
fecundity, and relative ability to withstand starvation of juveniles and 
adults. Here we add to the previous models two general features of 
natural consumer-—resource systems: density-dependent dynamics of 
resources and a saturating functional response for the consumers. 
This broadens the range of potential outcomes, including previously 
undiscovered types of cycle, and defines new empirical tests using 
consumer-resource systems, reported below. Our analysis exploits a 
new numerical bifurcation approach for stage-dependent delayed 
integro-differential models that facilitates resolution of regions of 
hysteresis, and allows us to identify robust properties of the different 
types of cycles that emerge. 

Our stage-structured model for resource—consumer dynamics (Box 
1 and Supplementary Information) is motivated by the extensive work 
on dynamics of Daphnia populations interacting with algal prey. 
Juvenile and adult predators (stages of Daphnia) differ in their feeding 
rates, energy allocation patterns, and resource-dependent mortality. 
We assume that the prey population is unstructured and prey cannot 
evolve. The resulting model that describes interactions of a ‘struc- 
tured’ predator population with a simple prey has received relatively 
little previous attention’, in contrast with a large body of theory that 
includes ‘structure’ in the prey or resource’*"*, used in studies of host— 
parasitoid interactions or disease dynamics’. Yet it contains the key 
features of many systems where prey complexity can be controlled. 

Empirically, the Daphnia—algal system displays an extraordinary 
range of dynamics that includes fluctuations around a stable equi- 
librium, and a combination of small- and large-amplitude 
cycles*®”!”, As predicted by previous theory, large-amplitude con- 
sumer-—resource cycles can be found under nutrient-rich conditions, 
the so-called ‘paradox of enrichment’*. However, in these same envir- 
onmental conditions, small-amplitude cycles not predicted by pre- 
vious theory were present and there is evidence that these periodic 
attractors coexist*’. This suggests that there is some fundamental 
feature of this system that produces these small-amplitude cycles 
even under nutrient-rich conditions where only large-amplitude 
cycles are expected. 

Bifurcation analysis of the stage-structured predator—prey model 
parameterized for the Daphnia-algal system (Fig. 1 and 
Supplementary Information) reveals new dynamics as the algal car- 
rying capacity is increased. In environments with low algal carrying 


lEcology and Evolution Group, Department of Biological Sciences, University of Calgary, Calgary, Alberta T2N IN4, Canada. *Department of Biology, Queen's University, Kingston, 
Ontario K7L 3N6, Canada. *Department of Ecology, Evolution, and Marine Biology, University of California, Santa Barbara, California 93106, USA. 
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Box 1| A structured model of the dynamics of a consumer and its 
resource 


Population balance equations are shown for processes and 
interactions, along with equations for juvenile development and 
survival. The top three equations describe the dynamics of resource (F) 
and juvenile (J) and adult (A) consumers, respectively. The dynamics 
of juvenile development time and juvenile survival are determined by 
integral equations (bottom two equations). Further model details, 
parameter definitions and estimation, and bifurcation analysis is 
provided in Supplementary Information. 

Resource dynamics: 
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capacity the model has a stable equilibrium, as is found in unstruc- 
tured predator-prey models. Increasing algal carrying capacity 
beyond ~0.3 mg C1” ', we encounter a bifurcation near which two 
cycles appear: a new small-amplitude resource-consumer cycle and, 
via a subcritical fold of periodic orbits, a large-amplitude resource— 
consumer cycle. Notably, the two cycles are coexisting attractors over 
a broad range of algal carrying capacities corresponding to nutrient- 
rich environments (Fig. la). Bifurcation analysis also reveals that the 
cycles are separated by an unstable limit cycle. This causes the trans- 
ition between the large- and small-amplitude cycles to occur abruptly 
(Supplementary Information), which is a common feature of labor- 
atory and field experiments. 

The two types of cycles can be distinguished by a key diagnostic 
feature’*": the ratio of the cycle period to juvenile stage duration 
(hereafter referred to as the delay). This is illustrated in Fig. 1b—e. In 
the large-amplitude cycles (Fig. 1b, c), the period exceeds the delay, a 
property that is characteristic of previously studied resource—con- 
sumer systems. By contrast, in the small-amplitude cycle (Fig. 1d, e), 
the juvenile stage duration is always larger than the cycle period, 
yielding cycles with a period/delay ratio less than one. The small- 
amplitude cycles are caused by the interaction between the instability 
caused by the resource-consumer interaction and the resource- 
dependent juvenile stage duration in the consumer Daphnia. 
Numerical experiments (not shown) established that removing the 
type II functional response that is the source of the instability in 
nutrient-rich algal environments, decoupling the prey dynamics by 
replacing density-dependent growth with semi-chemostat dynamics, 


LETTERS 


or fixing the juvenile stage duration at time periods longer than the 
cycle period, all cause the coexisting attractors to disappear. 

Our structured model thus apparently explains one of the major 
empirical results reported previously*: the Daphnia-—algal system can 
display both large- and small-amplitude cycles under the same nutri- 
ent-rich environment. This hypothesis can be rigorously tested by 
measuring the period/delay ratio in dynamic interacting popula- 
tions. There are some major challenges in measuring the develop- 
ment time of individuals (stage durations) non-destructively in a 
population where individuals cannot be marked to follow their 
growth or performance. Some attempts to dye individual Daphnia 
have been tried, but feeding performance is affected. We overcame 
these challenges by designing new bioassay experiments whereby we 
can measure growth of individuals in populations experiencing food 
dynamics set by the interaction between predators and prey (Fig. 2 
and Supplementary Information). Estimates of growth rates from 
these bioassays were validated through comparisons with ‘artificial 
cohort’ experiments (Supplementary Information). 

Replicate Daphnia—algal systems display the same collection of 
dynamics described previously*. Under common environmental 
conditions of nutrient enrichment, temperature and light levels, 
the predator—prey systems display either large-amplitude or small- 
amplitude cycles, with transitions between the attractors (Fig. 2 and 
Supplementary Information). Consistent with dynamics presented 
previously*, small-amplitude cycles predominate (~80% of the 
replicates) when resting egg production is not manipulated. 
Large- and small-amplitude cycles can be distinguished using two 
probes: variation in Daphnia egg density and in algal biomass (Fig. 3 
and Supplementary Information). The cycle periods for these 
dynamics average 21.4 days with lower and upper 95% prediction 
intervals of 14.5 and 28.2, respectively (Fig. 4). The bioassay experiments 
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Figure 1| Multiple limit-cycle attractors in the structured predator-prey 
model. a, Bifurcation diagram showing the transition from a stable steady 
state (solid black line) to a region of multiple coexisting limit cycles with 
increasing algal carrying capacity K (mg C1 '). The range in algal density 
(mg C1’) over a cycle is shown. Stable small-amplitude cycles (blue) and 
large-amplitude cycles (red) are shown, separated by an unstable cycle 
(dashed blue line). b, d, Large- and small-amplitude cycles of Daphnia 
(black) and algae (grey). ¢, e, Graphs showing a key diagnostic feature: the 
relationship between cycle period (dashed line) and the stage duration of 
Daphnia (solid line) during large- (c) and small-amplitude cycles (e). 
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Figure 2 | Bioassay experiments for resource-consumer systems. 
Distinguishing among causes of dynamics often requires observations on the 
performance of individuals in interacting populations. For many 
populations, individuals cannot be marked or followed, which restricts the 
estimation of key rates. Using flow-through systems coupled to microcosms 
(upper panel), we can estimate these rates for individuals. As the dynamics 
unfold in the microcosm, water containing algal prey flows through the vial 
housing an individual whose growth and reproduction can be observed 
during cyclic dynamics. Lower panels illustrate examples of large- and small- 
amplitude dynamics from which individuals were drawn to estimate length- 
specific growth rates. 


(Fig. 2) show that individual Daphnia grow at different rates in the 
large-amplitude and small-amplitude cycles (Fig. 4a,b), and thus 
the development time bears a fundamentally different relationship 
with the cycle period. The juvenile stage duration of individual 
Daphnia developing in the large-amplitude cycle is on average 
7.2 days, yielding a cycle period/delay ratio of ~3. During the 
small-amplitude cycles, juvenile stage durations were estimated to 
be 42 days (lower 95% prediction limit 37 days; upper limit 48 days), 
which is significantly longer than the cycle period, producing a frac- 
tional period/delay ratio. Independent experiments that estimate the 
length-specific growth rates of juvenile Daphnia in populations at 
equilibrium (Supplementary Information) yield comparable values 
to those obtained from the individual bioassay experiment (Fig. 4b). 
Thus, two key predictions from theory were supported by results 
from these new experiments. First, coexisting attractors were 
observed in the same enrichment environment. Second, the stage 
durations of individual Daphnia were significantly shorter than the 
cycle period in the large-amplitude cycle, and significantly longer 
than the cycle period in the small-amplitude cycle (Fig. 4). To our 
knowledge, this is the first empirical support for cycles with a frac- 
tional period/delay ratio. 

Theory on the dynamics of structured populations has provided 
new explanations for cycles and has enabled investigators to identify 
mechanisms giving rise to cycles of various types. Our experiments 
and measurements of individual performance in populations reveal a 
new type of small-amplitude cycle that arises from the interaction of 
instability in the predator—prey interaction causing fluctuations in 
resource density and a dynamically varying life-history feature of the 
predator. This interaction constrains the amplitude of the cycle even 
under nutrient-rich conditions where large-amplitude cycles prevail 
in either unstructured predator-prey models (for example, 
MacArthur—Rosenzweig models) or structured models with fixed 
stage durations for predators. The new cycles require the interaction 
of effects to emerge. 
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Figure 3 | Egg density dynamics during cycles. Dynamics of egg density in 
Daphnia populations displaying large-amplitude cycles (a) and small- 
amplitude cycles (b—d). The red line shows large-amplitude cycles as 
characterized by the fecundity probe (see Supplementary Information). The 
blue line shows small-amplitude cycles. The coefficient of variation (CV) in 
algal biomass is also presented. 


Simple models containing nonlinearities in biological processes can 
produce phenomena such as alternative stable states or coexisting 
attractors, complex transients as dynamics move from the influence 
of one basin of attraction to another, and a range of cycles. A key 
question is whether these ‘rich’ results from nonlinear models actually 
provide fundamental insight into the dynamics of ecological 
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Figure 4 | Comparison of individual growth rates in large- and small- 
amplitude cycles. Length-specific growth rates in large-amplitude cycles 
(a) and small amplitude cycles (b) yield different estimates of juvenile stage 
duration (c; Supplementary Information). Diamonds show estimates from 
artificial cohort experiments (Supplementary Information). In large- 
amplitude cycles, juvenile stage duration (c, red) is significantly less than 
cycle periods (c, grey). In small-amplitude cycles, the juvenile duration is 
significantly greater than the cycle period (c, blue). Dashed lines show 95% 
prediction intervals (Supplementary Information). 
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systems”. As we confront model predictions with well-studied experi- 
mental systems, where direct measurements of rates and manipula- 
tion of environmental conditions can be coupled quantitatively to 
models, more often than not, model expectations are upheld and 
general processes underlying the dynamics are being revealed. 
Structured population models of ecological interactions in particular 
are capturing the essence of dynamics of several ‘model’ systems (for 
example, Tribolium’’, rotifers’, Daphnia and Bosmina’’, California 
redscale*, Arctic char and brown trout~’). These models predict the 
occurrence of different dynamics as we move from one environment 
to the next, and identify key processes that lead to the different types of 
fluctuations. Although the models are costly to parameterize, the 
mechanisms apply to a broad range of organisms that share general 
life-history features, such as resource-dependent development rates 
and stage-specific interactions dictated by allometric scaling of energy 
acquisition and expenditure. Using these confrontations between the- 
ory and experiments, we are discovering important linkages between 
physiological ecology and life-history traits that explain dynamics of 
interacting populations in different environments. 


METHODS SUMMARY 

Daphnia-algal dynamics. Microcosms were established using techniques 
described previously* and maintained in a constant environment at 24°C with 
a 14:10 light cycle. Replicates were inoculated with a single clone of Daphnia 
pulex used in the previous experiments on food-dependent growth and repro- 
duction™. Initial nutrient enrichment with inorganic phosphorus and nitrogen 
placed the algal carrying capacity deep in parameter space where large-amplitude 
predator-prey cycles are predicted by theory*. Techniques for nutrient regen- 
eration, non-destructive sampling of Daphnia populations, and chlorophyll 
analysis were carried out as described previously*. Single clones of each algal 
species from the University of Toronto Culture Collection were used. Biweekly 
samples were taken and no manipulations of Daphnia egg production were 
performed. An accurate estimate of periodicity in the population dynamics 
can be obtained by measuring the time period between reproduction bursts of 
fecund females in the population (Supplementary Information). Dynamics from 
24 tanks were used to estimate this periodicity to compute the distribution 
shown in Fig. 4c. 

Bioassay experiments. To estimate stage duration, we drew individuals of vari- 
ous sizes from Daphnia populations executing different types of cycles (large- 
amplitude and small-amplitude cycles) (Supplementary Information). One 
individual was placed in each flow-through vial, and each peristaltic pump could 
supply and remove water and food for ten replicate vials. A typical (1 mm) 
juvenile Daphnia, living at 24°C, can filter at most 2mlh_! (ref. 24). The flow 
rate through each vial was 40 mlh_', ensuring that the individual could not 
significantly alter the food concentration. Each individual experienced the same 
food concentration as its counterparts in the population of Daphnia in the 
microcosm. Individuals were drawn from populations during the period of 
egg production and followed for a minimum of 3 days, for individuals 
<1.4mm, and 4 days for individuals >1.4 mm. An individual contributed only 
one estimate (that is, no repeated measures). Length-specific growth rates were 
calculated as In(L,/Lo)/time and regressed against length (mm) using nonlinear 
regression. Using these regression equations, the juvenile stage durations and 
95% prediction intervals were estimated by determining the time required to 
grow from the size at birth (0.69 mm) to the size at first reproduction (1.6 mm) 
(Supplementary Information). 
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GILT is a critical host factor for Listeria 


monocytogenes infection 


Reshma Singh'’”, Amanda Jamieson't & Peter Cresswel 


Listeria monocytogenes is a Gram-positive, intracellular, food- 
borne pathogen that can cause severe illness in humans and ani- 
mals. On infection, it is actively phagocytosed by macrophages’; it 
then escapes from the phagosome, replicates in the cytosol, and 
subsequently spreads from cell to cell by a non-lytic mechanism 
driven by actin polymerization’. Penetration of the phagosomal 
membrane is initiated by the secreted haemolysin listeriolysin O 
(LLO), which is essential for vacuolar escape in vitro and for viru- 
lence in animal models of infection®. Reduction is required to 
activate the lytic activity of LLO in vitro**, and we show here that 
reduction by the enzyme y-interferon-inducible lysosomal thiol 
reductase (GILT, also called Ifi30) is responsible for the activation 
of LLO in vivo. GILT is a soluble thiol reductase expressed con- 
stitutively within the lysosomes of antigen-presenting cells”*, and 
it accumulates in macrophage phagosomes as they mature into 
phagolysosomes’. The enzyme is delivered by a mannose-6- 
phosphate receptor-dependent mechanism to the endocytic path- 
way, where amino- and carboxy-terminal pro-peptides are cleaved 
to generate a 30-kDa mature enzyme”*”. The active site of GILT 
contains two cysteine residues in a CXXC motif that catalyses the 
reduction of disulphide bonds”*. Mice lacking GILT are deficient 
in generating major histocompatibility complex class-I]-restricted 
CD4* T-cell responses to protein antigens that contain disulphide 
bonds'”*. Here we show that these mice are resistant to L. mono- 
cytogenes infection. Replication of the organism in GILT-negative 
macrophages, or macrophages expressing an enzymatically inac- 
tive GILT mutant, is impaired because of delayed escape from the 
phagosome. GILT activates LLO within the phagosome by the thiol 
reductase mechanism shared by members of the thioredoxin 
family. In addition, purified GILT activates recombinant LLO, 
facilitating membrane permeabilization and red blood cell lysis. 
The data show that GILT is a critical host factor that facilitates 
L. monocytogenes infection. 

GILT is the only known thiol oxidoreductase present in phago- 
somes and we speculated that it might activate LLO in vivo. 
Consistent with this idea, when we infected wild type and GILT- 
deficient mice with L. monocytogenes we observed more rapid bac- 
terial clearance from the spleen and liver of the GILT-deficient mice 
(Fig. 1a). Initial infection was as effective as in wild-type mice on the 
basis of the similar bacterial colony-forming units (c.f.u.) observed at 
day one after infection. No difference in the clearance rates of 
Salmonella typhimurium was seen between wild type and GILT 
knockout mice (data not shown). To determine if the difference in 
clearance was a function of defective intracellular growth, we exa- 
mined bacterial growth in bone-marrow-derived macrophages in 
vitro. Replication of L. monocytogenes was clearly impaired in macro- 
phages from GILT-deficient mice (Fig. 1b). This was not due to an 
inherent defect in phagocytosis or bacterial killing by GILT /~ 


j-2 


macrophages, which cleared an in vitro infection by non-pathogenic 
Escherichia coli as efficiently as wild-type macrophages (Supple- 
mentary Fig. 1). Nor was it due to a problem in phagosomal 
acidification—acidification of both lysosomes and phagosomes 
occurred with the same kinetics in GILT ‘~ and wild-type macro- 
phages (Supplementary Fig. 2). 

Because GILT is active in phagosomes it seemed likely that the 
growth deficiency was due to a defect in phagosomal escape. Once 
in the cytosol, L. monocytogenes polymerizes host actin, which allows 
movement through the cytosol and facilitates intercellular spread’, 
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Figure 1| Growth of L. monocytogenes is decreased in GILT-deficient mice 
and GILT-deficient macrophages. a, In vivo colony counts of L. monocytogenes 
(strain 10403s) injected intravenously in wild-type (WT) C57BL/6 or GILT- 
deficient mice. At each time point three mice per group were killed and spleens 
and livers were harvested each day for bacterial colony counts to obtain 

c.f.u.g |. A representative of three individual experiments is shown. b, In vitro 
growth of L. monocytogenes in bone-marrow-derived macrophages from wild- 
type C57BL/6 and GILT-deficient mice. The cells (10° per well) were lysed at 
each time point shown and plated to obtain the c.f.u. A representative of three 
individual experiments is shown. Asterisk, P< 0.05; double asterisk, P< 0.01. 
Error bars in a and b indicate mean + standard error. 
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and actin polymerization provides a convenient assay for cytosolic 
entry. Indeed, we saw a delay in actin polymerization, detected by 
phalloidin staining, in infected GILT-deficient macrophages, con- 
sistent with impaired escape from the phagosome (Fig. 2a, b). In 
wild-type macrophages actin polymerization was observed as early 
as 30min after infection. This typically results in overwhelming 
infection and cell death within 36 to 48h. In GILT-deficient macro- 
phages, actin polymerization was not observed until 6-8 h. When 
wild type or GILT-deficient macrophages were infected with an 
LLO-negative L. monocytogenes strain the rate of actin polymeriza- 
tion was reduced to a level below that seen with wild-type bacteria in 
the GILT-negative cells (Fig. 2a, b). Consistent with a role for GILT in 
L. monocytogenes infection, a marked increase in cytosolic access was 
observed on infection of the GILT-negative human promonocytic 
cell line THP-1 when human GILT was expressed by retroviral trans- 
duction’’ (Supplementary Fig. 3). GILT is induced in THP-1 cells on 
exposure to bacteria, but more than 24h is required’. 

To confirm the escape defect we examined infected cells by elec- 
tron microscopy (Fig. 2c). After 2h virtually no bacteria were seen 
outside of membrane-bound phagosomes in GILT-deficient macro- 
phages, and only limited actin polymerization was observed even 8h 
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after infection. Furthermore, defective phagosomal escape, detected 
by the delay in the induction of actin polymerization, was reversed 
when GILT was expressed in GILT-negative macrophages by retro- 
viral transduction (Fig. 2d, e). However, expression of a double 
cysteine mutant of GILT (C69S/C71S), lacking both active site 
cysteine residues and unable to catalyse disulphide bond reduction", 
failed to reverse the escape defect. 

LLO has a single cysteine residue at position 485 and a mutant with 
an alanine substitution at that position does not require reduction for 
activation’. To determine if the mutation also reversed GILT 
dependence, we repeated the in vitro infection experiments described 
above with an L. monocytogenes strain expressing this LLO variant. 
For this organism growth was equivalent in wild type and GILT '~ 
macrophages (Fig. 3a), as was the rate of escape from the phagosome 
(Fig. 3b, c). We also examined the capacity of recombinant enzyma- 
tically active precursor GILT*'*”* to induce lytic activity in purified 
recombinant LLO (characterized in Supplementary Fig. 4). GILT 
clearly was able to activate LLO, determined by lysis of purified 
bone-marrow-derived macrophages (Fig. 3d) and haemolysis of 
sheep red blood cells (Fig. 3e). Similar lytic curves for macrophages 
were obtained using trypan blue exclusion (data not shown). Lysis 
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Figure 2 | Phagosomal escape of L. monocytogenes is delayed in GILT- 
negative macrophages. a, Actin polymerization in wild type and GILT 
knockout macrophages (M®) infected with wild-type L. monocytogenes or 
ALLO L. monocytogenes. Both strains of bacteria express GFP and are green. 
Phalloidin staining of actin is in red. b, Quantification of actin 
polymerization. A total of 1,000 cells per sample was counted for each time 
point. The data derive from the average of three experiments, and statistical 
significance was assessed by comparing the infection of wild type and 
GILT '~ macrophages with wild-type bacteria. Asterisk, P< 0.05; double 
asterisk, P< 0.01. Error bars indicate mean + standard error. 

c, Transmission electron micrographs of wild type or GILT ’ -infected 
macrophages. The arrowheads indicate intact phagosomal membrane, and 
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the reordering of the cytoplasm in the initial stages of actin polymerization 
can be seen around bacteria in the cytosol. Scale bar, 1 1m. No arrowheads 
are present in the 8h image for wild-type cells because no bacteria are 
contained within membrane-bound compartments. d, Actin polymerization 
in infected wild type and GILT-negative macrophages, and GILT-negative 
macrophages expressing GILT or inactive mutant GILT (C69S/C71S). 

e, Quantification of actin polymerization in macrophages infected as in 

d. The data derive from the average of three experiments and statistical 
significance was assessed by comparing the wild type and untransduced 
GILT ‘~ macrophages and, separately, the GILT ‘~ macrophages 
transduced with wild type and mutant GILT retroviruses. Asterisk, P< 0.05; 
double asterisk, P< 0.01. Error bars indicate mean + standard error. 
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was abrogated if the enzymatic activity of GILT was eliminated by 
pre-treatment with the thiol-reactive reagent N-ethylmaleimide 
(NEM). 

GILT shares with thioredoxin a reduction mechanism in which the 
N-terminal cysteine residue in the CXXC active site reduces a substrate 
disulphide bond by a nucleophilic attack on one of the involved 
cysteine residues. This generates a disulphide-linked mixed enzyme— 
substrate intermediate that is rapidly resolved by an attack of the 
second active site cysteine residue on the first”*. The second step in 
the reaction, known as the escape pathway’, can be prevented by 
mutation of the second active site cysteine, generating a ‘trapping 
mutant’ that allows mixed enzyme-substrate dimers to be isolated. 
To determine whether GILT uses this mechanism to activate LLO, 
bone-marrow-derived macrophages from wild type and GILT /~ 
mice were again infected with L. monocytogenes, together with 
GILT ‘~ macrophages that were retrovirally transduced with a 
GILT trapping mutant (C71S). Infected cells were lysed in detergent 
and immunoprecipitated GILT was subjected to SDS—polyacrylamide 
gel electrophoresis (PAGE) followed by western blotting to detect 
associated LLO. LLO co-precipitated with GILT only from infected 
macrophages expressing the trapping mutant (Fig. 4a). Immuno- 
fluorescence microscopy confirmed that in wild-type macrophages 
the organism co-localizes with GILT in LAMP-1-positive phagosomes 
(data not shown). Thus, GILT uses the classical thiol reductase mecha- 
nism to activate LLO in phagosomes and initiate the escape of 
L. monocytogenes to the cytosol. 

In the absence of LLO, L. monocytogenes may eventually escape 
from the phagosome, just as in the absence of GILT (Fig. 2a). 
However, when active GILT is present escape of LLO-positive 
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bacteria is extremely rapid. The identity of the disulphide bond tar- 
geted by GILT is unclear. The single cysteine present in LLO lies in a 
short tryptophan-rich sequence that initiates pore formation on bind- 
ing to membrane cholesterol. It has been suggested that a small thiol- 
containing molecule is disulphide linked to the cysteine residue, and 
that this inhibits activation of the haemolysins’’. The equivalent 
cysteine residue in perfringolysin O (PFO), a related haemolysin 
derived from Clostridium perfringens, is not necessary for binding to 
cholesterol in the membrane, but is necessary for the formation of a 
pre-pore complex'**. GILT-mediated exposure of the critical 
cysteine residue in LLO may facilitate a conformational change that 
allows the formation of the pre-pore complex and full activation. 
Notably, the ability of GILT to activate haemolysins is not limited 
to LLO: GILT can also activate streptolysin O (SLO), derived from 
Streptococcus pyogenes, as measured by the haemolysis of sheep red 
blood cells, but it is not required to activate an SLO mutant that lacks 
the characteristic single cysteine residue”’ (Supplementary Fig. 5). 

In most cell types GILT can be induced by interferon (IFN)-y, and 
IFN-y induction during the early stages of infection may enhance the 
ability of L. monocytogenes to infect other cells, including hepatocytes 
where it also replicates in vivo’*”*. The ability of GILT to activate SLO 
as well as LLO suggests that it may activate other members of this 
highly conserved family of haemolysins. Phagocytosis of haemolysin- 
expressing organisms may not be essential for activation of the lytic 
activity, as we have recently found that Toll-like receptor (TLR)- 
mediated activation of macrophages by E. coli lipopolysaccharide 
induces secretion of the enzymatically active precursor form of 
GILT’ that activates SLO in vitro (Supplementary Fig. 5). GILT 
is functional even at neutral pH, retaining approximately 30% of the 
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Figure 3 | LLO is activated by GILT reduction during infection and in cell- 
free assays. a, In vitro growth of wild type and C485A LLO mutant 

L. monocytogenes in wild type and GILT ‘~ macrophages. The experiment, 
performed as described in Fig. 1b, was repeated three times and a 
representative experiment is shown. Asterisk, P < 0.05; double asterisk, 
P<0.01. b, Actin polymerization in wild type and GILT ‘~ macrophages 
infected with wild-type L. monocytogenes or C485A LLO mutant 

L. monocytogenes. The bacteria were detected by immunofluorescence 
(green) and phalloidin staining of actin is in red. c, Quantification of actin 
polymerization as described in Fig. 2b. The data represent an average of three 
independent experiments; all points after 0.5 h are statistically significant to 
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P<0.01. d, Viability of GILT-negative macrophages assessed by propidium 
iodide staining after incubation of 1 X 10° cells with 300 ng LLO for 

30-45 min. Cells were incubated alone, with activated GILT (1 1g) and 

25 uM dithiothreitol (DTT), with unactivated LLO and 25 uM DTT, with 
DTT-activated LLO, or with LLO and 1 hig, 100 ng or 10 ng activated GILT. 
The data represent the average of three independent experiments and 
P-values for the samples with significant differences are shown. e, Haemolytic 
activity of purified LLO pre-incubated with active GILT or GILT inactivated 
by NEM treatment. The data represent the average of three independent 
experiments and P values for the samples with significant differences are 
shown. All error bars indicate mean + standard error. 
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Figure 4| GILT activates LLO by using the classical thioredoxin reduction 
mechanism. Wild-type macrophages, GILT ‘~ macrophages, or GILT '~ 
macrophages reconstituted with the C71S GILT trapping mutant were 
uninfected or infected with either wild-type L. monocytogenes or ALLO 

L. monocytogenes for 2h before detergent solubilization, 
immunoprecipitation (IP) and SDS-PAGE. Top panel: western blot of LLO 
in the detergent extracts. Middle panel: western blot with an LLO-specific 
antiserum of immunoprecipitates isolated from the extracts using a control 
monoclonal antibody (left three lanes) or an anti-GILT monoclonal 
antibody (right nine lanes). LLO is only co-precipitated with the C71S 
trapping mutant. Bottom panel: western blot for GILT in the 
immunoprecipitates. 


activity seen at pH 4.5 (refs 7 and 8) and it has been reported that the 
local pH can be as low as 5.7 at sites of bacterial infection’*. Secretion 
of GILT by macrophages at such a site could facilitate local haemo- 
lysin-mediated tissue damage, perhaps including lysis of inflammat- 
ory cells recruited for the purpose of host defence. 


METHODS SUMMARY 


The L. monocytogenes strains used are listed in the Methods. Infections in C57BL/6 
and C57BL/6 GILT knockout mice were performed as described*®. Bone-marrow- 
derived macrophages were prepared as described”. In vitro infections were done 
in the presence of gentamicin to prevent extracellular bacterial growth, and at a 
multiplicity of infection (MOI) of 5 (unless otherwise stated in the Methods). 

Electron microscopy and immunofluorescence experiments and antibodies 
used are as described in the Methods. Co-localization and quantification were 
performed by direct visualization on a Leica DMIRE2 confocal microscope. 
Figure assembly was done with Adobe Photoshop and Adobe Illustrator. 

LLO protein was purified according to published protocols with modifica- 
tions described in the Methods’. The mean + standard error is shown in 
the figures and P values were calculated using a two-tailed two-sample equal 
variance Student’s t-test. A P value of less than 0.05 was determined to be 
statistically significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Bacterial strains. Listeria monocytogenes was grown in brain heart infusion 
(BHI) broth (BD). The strains used were: wild type (10403s), L. monocyto- 
genes-GFP (1039), L. monocytogenes ALLO (1039ALLO) (gifts from 
H. Agaisse)** and DP-L4391 (C485A LLO) (a gift from D. Portnoy). LLO was 
purified from the DP-E3570 E. coli strain (donated by D. Portnoy) grown in 
Luria-Bertani (LB) broth (BD) supplemented with kanamycin. Production of 
LLO was induced by 1mM isopropyl-B-p-thiogalactopyranoside (IPTG) at 
30 °C. E. coli used for infection was grown in LB supplemented with ampicillin. 
In vivo bacterial infections. Infections of C57BL/6 and C57BL/6 GILT-deficient 
mice were performed as described previously”. Six-eight-week-old mice were 
used for each experiment. 

Cell culture. Bone-marrow-derived macrophages were cultured as described”. 
Briefly, bone marrow was harvested from the femurs of 8—10-week-old mice and 
cells cultured for 5-6days in RPMI 1640 containing 20% fetal calf serum 
(Hyclone), 100unitsml~’ penicillin (GIBCO), 100,gml~' streptomycin 
(GIBCO), 10mM HEPES (GIBCO), 1% non-essential amino acids (GIBCO), 
2mM L-glutamine (GIBCO), 1mM sodium pyruvate, and 0.035% B-mercap- 
toethanol, supplemented with 10 ng ml! granulocyte-monocyte colony stimu- 
lating factor. HEK 293 cells were grown in DMEM with 10% bovine calf serum 
(Hyclone). 

In vitro infections. Infections of macrophages for in vitro growth assays were at 
an MOI of 5 except for Fig. 1b where an MOI of 0.1 was used and Fig. 3a when an 
MO1 of 0.5 was used. For infection, 16h L. monocytogenes cultures were diluted 
1:10 in fresh BHI and grown for an additional 2 h at 37 °C with shaking. Aliquots 
of mid-log phase bacteria (~5 x 10° c.f.u. ml ') were washed once in PBS and 
used to infect macrophages in medium without antibiotics. After 30 min at 37 °C 
gentamicin (50 jig ml ') was added and the cells incubated for another 30 min at 
37 °C, washed, and incubated in 1 ml fresh medium at 37 °C. Cells were lysed at 0, 
2, 4, 6, 8, 10, 12 and 24h in 1 ml water for 5—10 min. Serial dilutions were plated 
on BHI plates containing chloramphenicol and colonies were counted the next 
day to determine c.f-u. 

Transmission electron microscopy and immunofluorescence. For transmis- 
sion electron microscopy, cells were fixed in 0.2% glutaraldehyde in 0.1 M caco- 
dylate buffer for 1h at room temperature and processed as described™*. For 
immunofluorescence, macrophages were plated on coverslips 12 h before infec- 
tion. The cells were fixed in 2% glutaraldehyde for 20 min at room temperature 
at each time point. For phalloidin staining, the coverslips were washed in PBS 
and permeabilized with 0.1% Triton X-100 in PBS for 3 min. After washing, 
methanolic phalloidin stock solution was diluted 40x into 1% BSA in PBS 
and the coverslips stained for 20 min at room temperature. Intracellular anti- 
body staining was done after cell permeabilization in 0.1% saponin for 20 min at 
room temperature. Primary antibodies were added for 30 min and secondary 
antibodies for a further 30min. The antibodies used were: R.mGILT", 
MaP.mGILT6 (mouse anti-mouse GILT monoclonal antibody)”, phalloidin 
conjugated to Alexa 546 (Molecular Probes), and rabbit anti-Listeria sp. FITC 
(Affinity BioReagents). Alexa Fluor 546 and 633 conjugated secondary antibo- 
dies were used (Molecular Probes). Coverslips were mounted using ProLong 
Gold mounting solution (Molecular Probes). 

Retroviral constructs and spinfection. Wild-type mouse GILT cDNA was 
cloned into the pLPCX vector (Clontech) using BglII and HindIII restriction 
enzymes. GILT mutants were made by site-directed mutagenesis with the fol- 
lowing primers: C69S/C71S, 5'-GAGTCCCTGTCCGGAGCTAGCCGCTACTT 


nature 


CCTCCG-3', and C718, 5’-CCCTGTGTGGAGCTAGCCGCTACTTCCTC-3’. 
To produce retrovirus, HEK 293 cells were transfected with 12 ug of each pLPCX 
construct and 12 jtg pCL-Eco with 60 pl Lipofectamine 2000 (Invitrogen). After 
12 h the medium was changed to macrophage culture medium and the cells were 
shifted to 32 °C. After 24h filtered supernatant was added to day 2 macrophage 
cultures. The cells were spinfected at 32°C, at 2,900 r.p.m. for 90min and 
cultured at 37 °C and differentiated as usual. 

LLO purification. Purification of LLO was as described except that the B-mer- 
captoethanol was omitted from all the buffers and 0.5 mM DTT was used for 
storage’. 

Cell lytic assays. Precursor GILT, purified from supernatants of baculovirus- 
infected insect cells as described’, was activated with 25uM DTT at room 
temperature for 10min, and for some experiments inactivated with NEM 
(1.5 mM) followed by dialysis against normal saline (pH 5.5). It was then incu- 
bated with LLO at 37 °C for 30 min. An aliquot of LLO was activated using DTT 
(2 mM) as a positive control. The samples were incubated with macrophages at 
37 °C for 30 min. Viability was assessed by propidium iodide staining and FACS 
analysis. For haemolysis assays LLO was added to sheep red blood cells 
(Innovative Research) in PBS, pH5.5, in 96-well plates on ice. After 30 min at 
37 °C lysis was determined spectrophotometrically, with Triton X-100 lysis ser- 
ving as 100% release. One haemolytic unit (HU) is the amount of toxin that 
releases half the haemoglobin. SLO (Aalto Bio Reagents Ltd.) was activated by 
4mM DTT. Purified SLO with a cysteine to alanine mutation at position 530 
(ref. 21) was a gift from N. Andrews. 

LLO co-immunoprecipitation. Macrophages were infected with L. monocyto- 
genes for 2 h and extracted in Tris-buffered saline, pH 7.4, containing 1% Triton 
X-100, protease inhibitors (Roche), and 10 mM methyl methanethiolsulphonate 
for 30 min on ice. GILT was immunoprecipitated using the monoclonal anti- 
body MaP.mGILT6, the samples separated by reducing SDS-PAGE and LLO 
detected by western blot using a rabbit anti-LLO antibody and goat anti-rabbit 
immunoglobulin conjugated with horseradish peroxidase (HRP). An anti-H2- 
K® monoclonal antibody, Y-3, was used asa control. The blot was developed with 
ECL reagents. 

Lysosomal and phagosomal acidification. Lysosomal pH was determined after 
uptake of Oregon-green-labelled dextran (10,000Da molecular mass, 
25 Lg ml~!; Molecular Probes) over 45 min on ice as described2”*°. Cells were 
extensively washed in PBS at neutral pH and read on a SpectraMax M5 plate 
reader (Molecular Devices), and analysed with SoftmaxPro software (Molecular 
Devices). Fluorescent emission at 520 nm was measured with alternating excita- 
tion at 450nm and 490 nm for 40 min. Conversion of the 450/490 excitation 
ratio to pH was calculated based on a standard curve generated using excitation 
ratios of the dextran in standard buffers. Phagosomal pH was determined after 
the uptake of 3 jtm carboxy beads (Polysciences Inc.) covalently labelled with 
carboxyfluorescein-SE (Molecular Probes) using a similar approach. 
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Non-random segregation of sister chromosomes in 


Escherichia coli 


Martin A. White’, John K. Eykelenboom’, Manuel A. Lopez-Vernaza’, Emily Wilson’ & David R. F. Leach’ 


It has long been known that the 5’ to 3’ polarity of DNA synthesis 
results in both a leading and lagging strand at all replication forks’. 
Until now, however, there has been no evidence that leading or 
lagging strands are spatially organized in any way within a cell. 
Here we show that chromosome segregation in Escherichia coli is 
not random but is driven in a manner that results in the leading 
and lagging strands being addressed to particular cellular destina- 
tions. These destinations are consistent with the known patterns 
of chromosome segregation”*. Our work demonstrates a new level 
of organization relating to the replication and segregation of the 
E. coli chromosome. 

Prokaryotic cells were long considered to be featureless until recent 
advances in imaging revealed an array of internal structures and sub- 
cellular organizations*. One such example is the nuclear architecture 
of E. coli, where the domains of the left and right chromosome arms 
occupy distinct cellular locations’. During replication, these domains 
are progressively segregated by an unknown mechanism that results in 
translational symmetry of the chromosome arms (known as repli- 
chores)**. Counter-intuitively, to obtain this translational symmetry 
there must be mirror symmetry in the segregation of the leading and 
lagging strands of the two replication forks (Fig. 1A). This leads to 
three possible situations. Either the two lagging strands of replication 
are positioned at mid-cell while the leading strands migrate to the cell 
poles; the two lagging strands migrate to the cell poles while the 
leading strands are positioned at mid-cell; or a random combination 
of these two segregation patterns occurs within the population. The 
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Figure 1| Distinguishing leading and lagging strands. A, Cartoon 
demonstrating how mirror symmetry of leading and lagging strands 
accounts for translational symmetry of sister chromosomes. Aa, Segregation 
of leading strands towards the cell poles and of lagging strands towards mid- 
cell. Ab, Segregation of lagging strands towards the cell poles and of leading 
strands towards mid-cell. B, SbcCD cleaves a DNA hairpin formed by the 
palindrome on the lagging strand of replication, and the broken 
chromosome is degraded in a recA’ mutant. 


specific hypothesis that leading strands segregate to cell poles has been 
proposed?® to explain the preponderance of highly expressed genes on 
the leading strands of the left (L) and right (R) replichores’. However, 
no evidence for or against this has yet been presented. 

To distinguish these three possibilities, we designed a construct 
(Supplementary Fig. 1) that would allow us specifically to visualize a 
locus on the right replichore (JacZ) that was replicated on the leading 
strand of the replication fork (Rjeaq). This is accomplished by indu- 
cing SbcCD-mediated palindrome cleavage* in a recA” mutant that 
has the palindrome flanked upstream by an array of tetO sites and 
downstream by an array of lacO sites. These arrays are visualized by 
fluorescence microscopy upon the binding of TetR-yellow fluor- 
escent protein (YFP) and LaclI-cyan fluorescent protein (CFP), 
respectively, and allow us to follow the cellular position of the 
chromosomal region containing the palindrome as it is replicated 
and segregated. This is visible as a twin YFP—CFP spot, demonstrat- 
ing the presence of DNA on both sides of the palindrome. Induction 
of SbcCD results in the specific cleavage of the palindrome that was 
replicated on the lagging strand* (Rig) by formation of a DNA 
hairpin’. In a recA” mutant the broken chromosome is degraded’” 
leaving behind the intact copy of the construct located on Rjeag 
(Fig. 1B). As long as the labelling method does not disrupt the known 
pattern of chromosome segregation, then the location of only one of 
the chromosome arms needs to be known for the rest to be inferred 
(Fig. 1A). 

The cellular position of this construct was followed during growth, 
and the number of segregated copies related to cell length. In the 
absence of induced double-strand breaks, 68.8% of exponentially 
growing cells longer than 1.50 ttm had two visibly segregated copies 
of the construct. In these cells, the loci were segregated such that one 
was located at mid-cell and the other at a cell pole (Fig. 2a). This 


Figure 2 | Visualization of construct. E. coli cell containing: a, two 
segregated copies of the construct; b, one located at mid-cell; and c, one 
located at a cell pole. The CFP signal is pseudocoloured green, YFP magenta. 
Overlapping CFP and YFP signals appear white. Visualization of the 
nucleoid in cells subjected to SbcCD-mediated palindrome cleavage for 
120 min by DAPI staining (DNA, blue) (d) and mounting on gelatin (DNA, 
white) (e). Scale bars, 1 [um (a—c) and 2 Lm (d, e). 
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localization pattern is consistent with published data on the pattern 
of chromosome segregation and localization in E. coli** and was 
indistinguishable from the recA* parental strains. Localization was 
also not independently affected by either induction of SbcCD or the 
presence of the palindrome. The induction of SbcCD expression in a 
palindrome-containing strain, however, caused a decrease in the 
number of cells longer than 1.50 |1m with two visibly segregated 
copies of the construct over time (Fig. 3a). This decrease was strongly 
associated with an increase in the number of cells with only one 
visible copy of the construct, suggesting that cleavage and degrada- 
tion of one of the replicated chromosomes had occurred. This was 
not observed in the control strain lacking the palindrome whereby 
80.2% + 3.1 (range, n=574) of cells had two visibly segregated 
copies of the construct at Ty and 81.2% + 1.8 (range, n = 796) after 
90min of SbcCD induction. Interestingly, visualization of the 
nucleoid by either DAPI staining or mounting cells on gelatin" 
revealed that even after extensive cleavage and degradation, the intact 
chromosome remains in one cell half (Fig. 2d, e). 

For cells longer than 1.50 {tm with only a single visible copy of the 
construct, the cellular position was classified as being closer to either 
the mid-cell (Fig. 2b) or the cell pole (Fig. 2c). In the absence of 
SbcCD-induced DNA double-strand breaks, the construct was found 
to be located in the two positions in approximately equal proportions 
(Fig. 3b). Notably, inducing SbcCD expression for 90 min in a strain 
containing a DNA palindrome resulted in the construct being located 
at the cell pole in most cells (89.2%). The slight bias towards being 
located at the cell pole (63.5%) at Ty is probably caused by leaky 
expression of SbcCD from the inducible promoter (Pgap)'” in min- 
imal growth medium. The simplest explanation of this is that SbcCD 
is specifically cleaving the palindrome located at mid-cell because it 
possessed a DNA hairpin that formed on the lagging strand of rep- 
lication. However, other formal possibilities must be considered. 

Time-lapse microscopy suggests that the chromosome is cleaved 
before visible segregation of the sister loci in most cells (data not 
shown). Therefore, to test the alternative hypothesis that the remain- 
ing copy of the palindrome migrated to the cell pole in response to the 
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Figure 3 | Construct degradation and localization. a, Number of visibly 

segregated copies of the construct (0, 1,2)inarecA -palindrome-containing 
strain as a function of time of SbcCD induction (error bars, range; n = 500 
for all time points). b, Location of construct in cells longer than 1.50 [um with 
a single segregated copy of the construct; error bars, range; pal, palindrome. 
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induced double-strand break as opposed to the normal process of 
chromosome segregation, strains were constructed that allowed 
expression of I-Scel from an inducible promoter with a unique 
I-Scel cleavage site in place of the palindrome. I-Scel is a site-specific 
endonuclease and therefore cannot distinguish between the cleavage 
sites located on the two sister chromosomes. Induction of cleavage by 
I-Scel for 90 min also resulted in an increase in the number of cells 
longer than 1.50 um with a single pair of foci (39.5% after 90 min of 
induction compared with 8.1% in a control strain lacking a cleavage 
site), again suggesting degradation of one of the sister chromosomes. 
The frequency was less than that observed with SbcCD-induced 
cleavage owing to a more dramatic increase in the cells with no visible 
foci, presumably because of instances of cleavage and degradation of 
all chromosomes. Unlike SbcCD-mediated palindrome cleavage, 
however, the position of the remaining copy of the construct was 
evenly distributed between mid-cell and cell pole positions (51.6% 
and 48.4%, respectively; Fig. 3b). This result argues against the possi- 
bility of the intact copy of the construct migrating to the cell pole in 
response to a DNA double-strand break. Furthermore, if cell division 
was inhibited by treatment with cephalexin to allow multiple rounds of 
replication and segregation within cells, induction of cleavage by 
SbcCD did not necessarily result in segregation to a cell pole (Fig. 4a, b). 

A final possibility is that SbcCD can only cleave a hairpin when it is 
located at mid-cell. Consistent with this argument, SbcCD-green- 
fluorescent protein (GFP) localizes at mid-cell in 54% of the popu- 
lation, with the remainder possessing two foci at the one-quarter and 
three-quarter positions of the cell'’. However, its co-localization with 
the replisomes makes it improbable that SbcCD is able to distinguish 
the two copies of the palindrome on the basis of their cellular loca- 
tion, considering that segregation of the two replicated sister loci is 
itself progressive from the replisomes. Also, the frequency of cleavage 
(more than 36% total DNA as measured on gels* and 52.8% of 
chromosomes as measured here by the frequency of chromosome 
degradation in cephalexin-induced filaments; Fig. 4c) is inconsistent 
with this possibility, which predicts a maximum of 25% cleavage per 
cell cycle if segregation of the leading and lagging strands is random. 

Our data therefore provide strong evidence that segregation of the 
two sister chromosomes in E. coli is not random and results in the 
placement of the two lagging strands at the cell quarters flanking the 
mid-cell (Fig. 1Aa). Non-random segregation of sister chromosomes 
has been observed previously, most notably in mammalian stem cells. 
This led to the proposal of the immortal strand hypothesis, which 
suggests that stem cells preferentially retain the grandmother DNA 
template strands to limit the accumulation of replication-induced 


c 
Strain Mean length Mean copies Copies per 
(um) of construct micrometre 
lacZ+ SbcCD* (Tj5)) 333 8.92 +0.44 6.3+0.3 0.72 + 0.02 (x) 
pal SbcCD* (T459) 358 14.22 40.57 4.5+0.2 0.34 + 0.01 (y) 


Errors indicate s.e.m. 


Figure 4 | Induction of SbcCD in cephalexin-induced filaments. 
Cephalexin-treated cell without (a) and with (b) the palindrome after 

120 min of SbcCD induction. The CFP signal is pseudocoloured green, YFP 
is magenta. Overlapping CFP and YFP signals appear white. Scale bars, 5 um. 
c, Analysis of the consequences of SbcCD expression in cephalexin-treated 
cells. Frequency of cleavage was calculated as 1 —y/x. 
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mutations. The immortal strand hypothesis is controversial’, 


however, and other explanations for the observed asymmetric 
segregation, such as epigenetic markers, have been proposed". 
These hypotheses require a mechanism of distinguishing sister chro- 
matids. It has recently been postulated that such a discrimination 
could be made on the basis of whether the centromeric region of the 
chromosomes were replicated on the leading or lagging strand’. 
Here we have shown that a similar distinction is made by E. coli. 

Our data do not provide a mechanistic understanding of this 
aspect of chromosome segregation. This is not surprising considering 
that the mechanism underlying chromosome segregation in E. coli is 
still unknown. The velocity at which the newly replicated origin 
regions are segregated indicates that chromosome segregation is an 
active process”, and a centromere-like sequence (migS) has been 
identified*’. Because later replicated regions of the chromosome do 
not migrate as rapidly, it has been proposed that segregation is a two- 
step process instigated by the active positioning of the origins”. 
Recent evidence suggests a role for the structural maintenance of 
chromosome protein MukB”’ in this initial step. It has also been 
suggested that the actin-like cytoskeleton protein MreB has a role in 
chromosome segregation”*”’, although this remains controversial”®. 

One possibility is that the pattern of segregation that we have 
observed is a consequence of the structure of the two replisomes at 
the replication factory”. In E. coli the leading and lagging strand 
polymerases are physically linked in a structure known as the repli- 
some. In the replisome, the lagging strand template is looped to allow 
the two polymerases to translocate in the same direction’. Although 
this loop is small, it may be enough to constrain the lagging strands 
physically to the replisome while allowing the leading strands to 
segregate to the cell poles. This would fit with the replication factory 
model, where the two replisomes are located at mid-cell. It is also 
plausible that it is advantageous for the cell to keep newly replicated 
lagging strands at mid-cell for further processing”, and that this local 
requirement is reflected in global organization. Recent evidence, 
however, argues against the existence of a replication factory in 
E. coli and implies that the replisomes simply track along the 
DNA”. If this is indeed the case, then it would suggest that newly 
replicated DNA follows a nucleoid organization that is established at 
the initial stages of the segregation process. 


METHODS SUMMARY 

Strains and plasmids. Strains were BW27784 derivatives allowing homogeneous 
expression from Pgap'*. Mutations were introduced by either P1 transduction or 
plasmid-mediated gene replacement’. Insertions and deletions were confirmed 
by PCR. recA” mutants were confirmed as ultraviolet light sensitive. Full details 
of strains and plasmids used can be found in Supplementary Information. 
Microscopy. Strains were grown at 37 °C in M9 minimal medium (with 0.2% 
glycerol) supplemented with 100ngml~! anhydrotetracycline to prevent 
operator-bound TetR-YFP from blocking replication’’. Unless otherwise stated, 
live cells were mounted on a bed of 1% agarose-H,O for viewing under the 
microscope. To induce expression from Pgap, cells were grown until the optical 
density at 600 nm (OD¢o0nm) = 1.0 in minimal medium supplemented with 
0.2% glucose, before diluting to OD¢o0nm = 0.2 in minimal medium supple- 
mented with 0.001% arabinose (To). To inhibit cell division, cells were grown for 
90 min in the presence of 10 pg ml! cephalexin before inducing SbcCD express- 
ion by diluting to OD¢o9 = 0.2 in minimal medium (with 0.001% arabinose) 
supplemented with cephalexin. SbcCD was induced for 120 min before image 
acquisition. To prevent cross-signal from the plasmid, nucleoids were visualized 
in strains DL3339 and DL3340. Nucleoids were visualized either by staining with 
DAPI (1 pg ml” ') or mounting live cells on 27% bovine gelatine-M9 minimal 
medium. 

Images were acquired at a resolution of 0.129 1m per pixel using a Zeiss 
Axiovert 200 fluorescence microscope equipped with a Photometrics cool- 
SNAP HQ CCD (charge-coupled device) camera and the acquisition software 
MetaMorph 6.3r2. CFP and YFP images were subject to three-dimensional adap- 
tive point-spread function (blind) deconvolution using Autodeblur and 
Autovisualize v9.3 before analysis. Cell lengths were measured using the ‘fibre 
length’ measurement of MetaMorph 6.3r2. Images were pseudocoloured using 
MetaMorph 6.312. The range indicates the result of two independent experiments. 
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Comprehensive mass-spectrometry-based proteome 
quantification of haploid versus diploid yeast 


Lyris M. F. de Godoy'*, Jesper V. Olsen’*, Jiirgen Cox'*, Michael L. Nielsen'*, Nina C. Hubner’, Florian Frohlich’, 


Tobias C. Walther? & Matthias Mann! 


Mass spectrometry is a powerful technology for the analysis of large 
numbers of endogenous proteins’”. However, the analytical chal- 
lenges associated with comprehensive identification and relative 
quantification of cellular proteomes have so far appeared to be insur- 
mountable’. Here, using advances in computational proteomics, 
instrument performance and sample preparation strategies, we com- 
pare protein levels of essentially all endogenous proteins in haploid 
yeast cells to their diploid counterparts. Our analysis spans more 
than four orders of magnitude in protein abundance with no dis- 
crimination against membrane or low level regulatory proteins. 
Stable-isotope labelling by amino acids in cell culture (SILAC) quan- 
tification*” was very accurate across the proteome, as demonstrated 
by one-to-one ratios of most yeast proteins. Key members of the 
pheromone pathway were specific to haploid yeast but others were 
unaltered, suggesting an efficient control mechanism of the mating 
response. Several retrotransposon-associated proteins were specific 
to haploid yeast. Gene ontology analysis pinpointed a significant 
change for cell wall components in agreement with geometrical con- 
siderations: diploid cells have twice the volume but not twice the 
surface area of haploid cells. Transcriptome levels agreed poorly with 
proteome changes overall. However, after filtering out low confid- 
ence microarray measurements, messenger RNA changes and SILAC 
ratios correlated very well for pheromone pathway components. 
Systems-wide, precise quantification directly at the protein level 
opens up new perspectives in post-genomics and systems biology. 

Yeast launched the genome era® and continues to be an informa- 
tive model system for genomic and post-genomics technologies. It 
has also been a fruitful testing ground for mass spectrometry (MS)- 
based proteomics’ '°. Repositories of yeast proteomics experiments 
contain about 4,000 proteins, albeit with varying confidence of iden- 
tification'’. Previously, we established that half of the yeast proteome 
could be detected with very high stringency by MS ina single experi- 
ment'*. The phosphoproteome of pheromone signalling has already 
been investigated by a SILAC experiment’. Until now, no strategies 
have been described to comprehensively identify, much less to com- 
prehensively quantify, two states of the yeast proteome against each 
other in a single experiment. 

To develop methods for proteome-wide quantification, we meta- 
bolically labelled haploid and diploid yeast with arginine and lysine 
SILAC. We investigated three strategies to achieve deep coverage of 
the yeast proteome: extensive fractionation of proteins; fractionation 
of digested peptides; and accumulating and sequencing distinct mass 
ranges of peptides (Fig. 1, Methods). The second strategy, combining 
in-solution digest with peptide separation by isoelectric focusing, 
yielded the most proteins (3,987) and is by far the simplest. 

Together, we identified 4,399 proteins with 99% certainty 
(Supplementary Table 4). Unambiguous identification only requires 


a few peptides per protein; however, on average we covered 32% of 
each protein sequence. 

Previously, expressed yeast genes were detected by a fused tandem 
affinity tag (TAP)"* or green fluorescent protein (GFP) tag’* in genome- 
wide experiments (Fig. 2a and Table 1) and our data overlaps 89% 
with each of these tagging approaches. In addition, MS identified 510 
proteins exclusively, including proteins in which the tag interferes with 
function, such as tail-anchored membrane proteins and proteins 
requiring carboxy-terminal modifications. As judged by MS, several 
hundred proteins previously reported at less than 50 copies per cell 
were part of different abundance classes over the whole dynamic range 
(Supplementary Fig. 5). Our data set is not biased against low-abun- 
dance proteins (Fig. 2b) or membrane proteins (30.9% of all proteins 
detected and 29.4% of the genome). Only 6% of yeast open reading 
frames (ORFs) were detected by both tagging methods but not by MS 
(Fig. 2a). This is less than the discrepancy between the tagging methods 
and includes 12 proteins that are inaccessible to MS due to a lack of 
appropriate tryptic or LysC cleavage sites, 33 proteins with overlapping 
genes (which we only counted as single identifications), 11 that have 
been removed from the database during the last three years, 8 dubious 
genes and 78 proteins for which no western blot quantification had been 
possible. Thus, of the accessible proteome, at most a few per cent of 
proteins are not detected. High-resolution data from the orbitrap 
instrument combined with efficient computational strategies led to very 
high peptide mass accuracy (average absolute mass deviation of 
590 p.p.b.) and to very high identification rates for mass spectrometric 
peptide fragmentation (>53% on SILAC peptide pairs, Fig. 1d and 
Methods), contributing to the identification of essentially the entire 
yeast proteome expressed in log-phase cells. 

Next, we determined the fold change of SILAC peptide pairs for 
relative proteome quantification between haploid and diploid yeast 
cells. In arginine and lysine double-labelled populations, we noticed 
that the proteomes were substantially different due to the presence of 
different sets of auxotrophic markers in the haploid and diploid 
strains (Supplementary Fig. 6). We therefore based our quantitative 
analysis on the lysine-labelled haploid S288C yeast strain and com- 
pared it to an isogenic diploid strain (Fig. 1b, cand Methods). A total 
of 1,788,451 SILAC peptide pairs were identified and quantified 
(median of 32 pairs per protein). Figure 3a and Supplementary 
Tables 6 and 7 show the ratios of all 4,033 quantified proteins and 
peptides from the lysine-labelling experiments. We achieved very 
high quantification accuracy, with 97.3% of the proteome changing 
less than 50% in abundance between haploid and diploid cells. 
Quantification after fractionation of digested peptides (Fig. 1b) 
showed excellent reproducibility (R=0.84 on average; Supple- 
mentary Fig. 7). One-hundred-and-ninety-six proteins changed 
significantly (P< 0.001), and we confirmed the regulation of 29 of 
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Figure 1| Three strategies for in-depth quantification of the yeast 
proteome by SILAC labelling and high-resolution mass spectrometry. 

a, Arginine and lysine SILAC labelling of haploid and diploid yeast. Arg10 is 
(°Cg,!°Na]L-arginine, Lys8 is [1°Cg/!°N2]1-lysine, and Arg0 and Lys0 are the 
normal, non-substituted amino acids. Extensive fractionation followed by 
tryptic digestion and one-dimensional gel electrophoresis as well as online 
LC-MS/MS on a hybrid linear ion trap—orbitrap instrument yielded, 
through triplicate measurements, 3,639 identified proteins at high 
stringency using the MaxQuant algorithms (J.C. and M.M., submitted; 
Supplementary Table 1). b, Lysine SILAC labelling of haploid and diploid 
yeast. Triplicate measurements of in-solution digestion with endoprotease 
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Figure 2 | Proteome coverage. a, Comparison of coverage of MS-based 
proteomics with GFP- and TAP-tagging methods’*’’. Numbers are the 
identified proteins by each method and, in parentheses, the number of 
dubious open reading frames (ORFs). b, Identified proteins per copy 
number bin for MS-based proteomics and the two tagging approaches. Copy 
numbers were estimated by correlation between summed peptide intensity 
per protein and the quantitative western blotting data‘* (Methods). 
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LysC followed by isoelectric focusing into 24 fractions and online LC-MS/ 
MS resulted in a proteome of 3,987 proteins (Supplementary Table 2). 
c, Same as b except that each isoelectric fraction is analysed five times with 
ion accumulation of a narrow m/z range for higher dynamic range. The 
signal-to-noise ratio and dynamic range improved by about a factor of five 
(Supplementary Fig. 1) and 3,779 proteins were identified (Supplementary 
Table 3). d, Typical contour plot of a single LC-MS/MS run. Peptide pairs 
eluting from the column (green) were automatically fragmented (blue 


crosses) and more than 60% of sequencing events on SILAC pairs resulted in 
successful identification (purple boxes). 


the top-regulated ones by western blot against either the fused TAP or 
GFP tag from the systematic collection’ (Supplementary Fig. 8). All 
ratios were in the same direction as that observed by MS-based pro- 
teomics. Kyoto Encyclopedia of Genes and Genomes (KEGG) path- 
way and Gene Ontology analysis (Supplementary Table 8) 
highlighted lysine biosynthesis as being upregulated in diploid cells 
(P=5 X10 °). This is due to heterozygosity for LYS2/lys2 and illus- 
trates the ability of proteome-wide quantification to pinpoint altered 
metabolic pathways (Supplementary Fig. 9a, c). 

Pheromone signalling is required for mating of haploid cells and is 
absent from diploid cells'®. The top ten haploid-specific proteins as 
determined by SILAC are components or transcriptional targets of 
pheromone signalling (Supplementary Table 9). Surprisingly, not all 
of its members are regulated equally (Fig. 3b). Key components of the 
signal transduction pathway and output factors were absent from 
diploid cells: the pheromone receptor (Ste2), the signal transducing 
G protein (consisting of Ste4, Stel8 and Gpal), the mitogen-acti- 
vated protein kinase (MAPK) scaffold protein Ste5, the MAPK Fus3 
and the output transcription factor Stel2. In contrast, the 


Table 1| Yeast ORFs identified by SILAC-based quantitative proteomics 


Number of ORFs TAP GFP nanoLC-MS 

Total yeast ORFs 6,608 4,251 4,154 4,399 
Characterized yeast 4,666 3,629 3,581 3,824 
ORFs 
Uncharacterized yeast 1,128 581 539 572 
ORFs 
Dubious yeast ORFs 814 26 (3%) 23 (3%) 3(<1%) 

ot present in ORF 15 11 0) 
database 


Comparative sequencing shows that 814 of the 6,608 yeast ORFs are never expressed (dubious 
ORFs, http://www. yeastgenome.org). Of these only six were identified in this experiment and 
hree were validated by SILAC-assisted de novo sequencing of several peptides (Supplementary 
Table 5 and Supplementary Figs 2-4). Two of the three validated ones were reclassified as 
genuine yeast genes during writing of this manuscript (YGLO41W-A and YPRI7OW-B). This 
eaves three potential false-positives (0.37% of 815) and suggests that our estimate of a false- 
positive identification rate of maximally 1% is conservative. 
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MAPKKKK Ste20, the MAPKKK Stell and the MAPKK Ste7 
remained unchanged. For some of these kinases, such as Ste7 or 
Ste11, this is readily explained because they fulfil another function 
in the osmolarity-sensing and filamentous growth pathway'’. For 
other proteins, such as the Far3/7/8/11 protein complex that med- 
iates one pathway of cell cycle arrest during the pheromone response, 
this is unexpected and might indicate that they have another function 
in haploid cells. This suggests another repressive function of Far3 
during the cell cycle. Consistently, its inactivation results in faster 
growth of haploid cells’®. 

The proteins encoded by retrotransposons Tyl and Ty2 are about 
ten times more abundant in haploid cells, consistent with regulation 
of specific Ty mRNAs by pheromone signalling in haploid cells and 
repression in diploid cells by the MATa/« transcription factor'’?”®. 
We also found the Ty1 transcription activator Tecl to be eight times 
more expressed in haploid cells. Little is known about the evolution- 
ary advantage of restricting retrotransposition to haploid cells, but 
because most wild-type cells are diploid, the repression of transposi- 
tion in these cells might be used to minimize the spread of det- 
rimental effects through the population. 

Cell wall components were statistically significantly reduced in 
diploid cells (P= 2.710 °; Supplementary Table 8). At first 
glance, this is surprising because diploid cells are on average twice 
as large as haploid cells and also have more cell wall. However, larger 
cells need less surface components in relation to ‘bulk’ proteins, and 
the observed downregulation (0.77) is very close to what would be 
expected from geometrical considerations: a sphere of double 
volume has 27° the surface and thus should have 27/°/2 = 0.79 the 
amount of surface proteins after normalization for the doubled 
volume. The list of differentially expressed factors also contains a 
number of uncharacterized genes, which can be mined for haploid- 
specific functions. 

A longstanding question in functional genomics is to what extent 
changes in mRNA levels lead to changes of the active agents in the 
cell, the proteins*'. Overall correlation of mRNA” and protein 
changes was poor (R= 0.24) and there were large populations of 
genes with mRNA but no protein change (Fig. 4a). However, after 
we filtered out low-level microarray signals (Supplementary Fig. 10), 
the correlation improved to 0.46 (Fig. 4b). Several of the remaining, 
discordant mRNA changes seem to be technical artefacts. For 
example, INO1, the protein level of which did not change, is the only 
representative of several co-regulated genes (for example, CHO] and 
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CHO2) that was found upregulated by microarray analysis. CTS1, 
which was downregulated according to microarray analysis, was 
upregulated when measured by SILAC and western blot. Several 
lysine biosynthesis pathway genes seem to be regulated at the protein 
but not the mRNA level (magenta in Fig. 4b). However, this is due to 
use of lysine auxotrophs in the MS but not the microarray experi- 
ments. Among genes only found upregulated by proteomics (blue in 
Fig. 4b), cell wall proteins were highly overrepresented 
(P=7.7 X 10 °, see Methods). This could be due to the microarray 
experiment not detecting slight expression changes for this class of 
proteins. Strongly regulated genes in both data sets were mainly 
components of the pheromone response. Here, correlation between 
mRNA and protein changes was high (R = 0.68; Fig. 4c). However, 
actual fold changes determined by microarrays deviated considerably 
from the values provided by the SILAC quantification 
(Supplementary Table 10). This is probably due to technical differ- 
ences (that is, microarray measurements are not strictly quantitative) 
combined with the fact that the level of mRNA change may not 
directly be translated into a change of protein level. 

In summary, a combination of SILAC labelling, high-resolution 
MS and sophisticated computational proteomics allows accurate 
quantitative analysis of an entire proteome. Among several tested 
strategies, in-solution digest of unfractionated cell lysate followed 
by simple isoelectric focusing of the peptides proved most powerful. 

Key advantages of MS-based proteomics are the ability to measure 
endogenous rather than tagged versions of proteins, which may have 
altered expression levels, and to quantify the entire proteome from 
one sample. Our comparison of the proteome with the transcriptome 
highlights several crucial points for systems-wide analysis. First, pro- 
teomics can directly measure small changes in the amounts of pro- 
teins, which might have important effects in the cell. Second, it shows 
that the relationship between mRNA and protein levels depends on 
the proteins investigated. This effect is likely to be even more notable 
in mammalian proteomes, which compared to yeast are more com- 
plex and subject to more post-transcriptional contro]. A mammalian 
cell is commonly thought to express 10,000 gene products, which 
would only be two to three times the number of genes expressed in 
yeast. Thus, we predict that essentially complete mammalian pro- 
teomes—with at least one representative protein per expressed 
gene—will be feasible with refined versions of our strategy’. The 
next challenge will then be proteome-wide identification of func- 
tionally important isoforms and modifications. 


Figure 3 | Quantitative differences 
between the haploid and diploid 
yeast proteome. a, Overall fold 
change for the yeast proteome. 

b, Members of the yeast pheromone 
response are colour-coded 
according to fold change. The 
diploid to haploid ratio as 
determined by SILAC is indicated 
for each protein. Figure is adapted 
from ref. 13. 
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Figure 4 | Proteome and transcriptome changes of haploid versus diploid 
yeast. a, The overall correlation between protein and mRNA changes is poor 
(R = 0.24). b, After filtering out low mRNA signals, the data from a correlates 
better (Supplementary Fig. 10). Red, significantly upregulated as mRNA and 
protein; blue, significantly upregulated as protein; green, significantly 


METHODS SUMMARY 


Yeast diploid and haploid strains were SILAC-labelled as described’ with 
[PCy 'SN)]L-lysine-and/or [PGs 'SN, L-arginine. The diploid yeast strain TWY 
809 was generated by crossing the wild-type BY4741 and BY4742. The haploid strain 
for lysine labelling was generated by sporulation of BY4743 and selection for the 
lysine auxotroph, MATa cells. Yeast cells were lysed, mixed 1:1, fractionated by SDS— 
PAGE and in-gel digested with trypsin as described previously'’. Alternatively, after 
mixing, proteins were digested in-solution by the endoproteinase LysC and the 
resulting peptide mixtures were fractionated by peptide isoelectric focusing. Each 
fraction was subsequently analysed by online liquid chromatography—tandem mass 
spectrometry (LC-MS/MS). All LC-MS/MS experiments were performed on an 
LTQ-Orbitrap (Thermo Fisher Scientific) mass spectrometer connected to an 
Agilent 1200 nanoflow HPLC system by means of a nanoelectrospray source 
(Proxeon Biosystems). MS full scans were acquired in the Orbitrap analyser using 
internal lock mass recalibration in real-time** whereas tandem mass spectra were 
simultaneously recorded in the linear ion trap. Peptides were identified from MS/ 
MS spectra by searching them against the yeast ORF database (Stanford University) 
using the Mascot search algorithm’? (http://www.matrixscience.com), and all 
SILAC pairs were quantified by MaxQuant (J.C. and M.M., submitted). For several 
of the top-regulated proteins, GFP- or TAP-tagged haploid and diploid strains were 
generated and the regulation was confirmed by western blot. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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changed as mRNA; and magenta, significantly downregulated as protein. 
c, Proteins involved in pheromone response (Fig. 3b) are co-regulated at 
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METHODS 
Generation and SILAC-labelling of haploid and diploid yeast strains. The 
Saccharomyces cerevisiae diploid strain YLG1 was generated by crossing the 
haploid YAL6B MATa strain’’ with one of its parental strains, Y15969 MATa 
(Euroscarf). The diploid yeast strain TWY 809 was generated by crossing the 
wild-type BY4741 and BY4742. The haploid strain for lysine labelling was gen- 
erated by sporulation of BY4743 and selection for the lysine auxotroph, MATa 
cells. The arginine and lysine double SILAC labelling was performed as 
described’, with small modifications. In brief, cells from the haploid YAL6B 
strain, which has a LYS] and ARG4 gene deletions and is therefore a double 
auxotroph for lysine and arginine, and the diploid YLG1 strain were grown in 
YNB liquid medium containing either 20 mg”! [1°C,/'°N3]1-lysine (Lys8) and 
5mgl! [1°C,,'°N,]L-arginine (Arg10; Isotec-Sigma) or 20 mg]! 1-lysine and 
5 mg! L-arginine for ten generations, until they reached log-phase (Dgoo 0.7). 
Lysis and protein fractionation strategy. Normal and heavy SILAC-labelled 
yeast cells were collected by centrifugation, resuspended in lysis buffer 
(150 mM potassium acetate, 2mM magnesium acetate, 1X protease inhibitor 
cocktail (Roche), and 20 mM HEPES, pH 7.4) and frozen in liquid N>. Haploid 
and diploid frozen cells were mixed 1:1 on the basis of protein amount (as 
determined by Bradford assay) and mechanically disrupted in a milling device 
(MM301 Ball Mill, Retsch), with 3 cycles of 3min at 10 Hz, intercalated by 
immersion in liquid N>. All further steps were performed at 4°C. The extract 
was allowed to thaw and centrifuged for 4 min at 1,000g. The pellet was collected, 
washed twice with lysis buffer, resuspended in PBS containing 2% SDS, incubated 
for 5 min at 65 °C and spun down to remove debris (fraction 1). The sample was 
centrifuged for 10 min at 20,000g and the resultant pellet washed twice with lysis 
buffer and resuspended in PBS containing 2% SDS (fraction 2). The supernatant 
was brought to 60% (NH4)SO,, incubated for 10 min under rotation to allow 
protein precipitation, centrifuged for 10min at 20,000g and the precipitated 
proteins resuspended in PBS containing 2% SDS (fraction 3). The concentration 
of (NH4)2SO, was raised to 80%, the sample processed as before, the precipitated 
proteins resuspended in PBS containing 2% SDS (fraction 4) and the remaining 
soluble proteins dialysed against PBS containing 2% SDS (fraction 5). 
In-solution digestion. Proteins extracted from lysine-labelled haploid and dip- 
loid yeast were reduced for 20 min at room temperature (24 °C) in 1 mM dithio- 
threitol and then alkylated for 15 min by 5.5 mM iodoacetamide (IAA) at room 
temperature in the dark. Endoproteinase LysC (Wako) was added 1:50 (w/w) and 
the lysates were digested overnight at room temperature (12h). Arginine- and 
lysine-labelled yeast proteins were digested with LysC in a similar manner, and the 
resulting peptide mixtures were diluted with Millipore water to achieve a final 
urea concentration below 2 M. Trypsin (modified sequencing grade, Promega) 
was added 1:50 (w/w) and digested overnight. Trypsin and LysC activity were 
quenched by acidification of the reaction mixtures with TFA to ~pH 2. 
Peptide isoelectric focusing. In-solution digested peptides (75 tig) were sepa- 
rated according to their isoelectric point using the Agilent 3100 OFFGEL frac- 
tionator (Agilent, G3100AA). The system was set up according to the manual of 
the High Res Kit, pH 3-10 (Agilent, 5188-6424), but strips were exchanged by 
24 cm Immobiline DryStrip, pH 3-10 (GE Healthcare, 17-6002-44), and ampho- 
lytes were substituted by IPG buffer, pH 3-10 (GE Helthcare, 17-6000-87), used 
1:50. Peptides were focused for 50 kilovolt hours (kVh) at a maximum current of 
50 pA, maximum voltage of 8,000 V and maximum power of 200 mW into 24 
fractions. Each peptide fraction was acidified by adding 3% acetonitrile, 1% 
trifluoroacetic acid and 0.5% acetic acid, then desalted and concentrated on a 
reversed-phase C18 StageTip”’. 
Gel electrophoresis and in-gel digestion. Each lysine- and arginine-labelled yeast 
protein fraction was boiled in 2 LDS buffer, separated by one-dimensional SDS— 
PAGE (4~-12% Novex mini-gel, Invitrogen) and visualized by colloidal Coomassie 
staining. The entire protein gel lanes were excised and cut into 20 slices each. Every 
gel slice was subjected to in-gel digestion with trypsin’. The resulting tryptic 
peptides were extracted by 30% acetonitrile in 3% TFA, reduced in a Speed Vac, 
and desalted and concentrated on a reversed-phase C18 StageTip”®. 
Mass spectrometric analysis. All MS experiments were performed on a nano- 
flow HPLC system (Agilent Technologies 1200) connected to a hybrid LTQ- 
orbitrap classic or XL (Thermo Fisher Scientific) equipped with a nanoelectros- 
pray ion source (Proxeon Biosystems) as described™ with a few modifications. In 
brief, the peptide mixtures were separated in a 15 cm analytical column (75 um 
inner diameter) in-house packed with 3-lym C18 beads (Reprosil-AQ Pur, Dr. 
Maisch) with a 2 h gradient from 5% to 40% acetonitrile in 0.5% acetic acid. The 
effluent from the HPLC was directly electrosprayed into the mass spectrometer. 
The MS instrument was operated in data-dependent mode to automatically 
switch between full-scan MS and MS/MS acquisition. Survey full-scan MS spectra 
(from m/z 300-2,000) were acquired in the orbitrap with resolution R = 60,000 at 
m/z 400 (after accumulation to a ‘target value’ of 1,000,000 in the linear ion trap). 
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The ten most intense peptide ions with charge states =2 were sequentially isolated 
to a target value of 5,000 and fragmented in the linear ion trap by collisionally 
induced dissociation. Fragment ion spectra were recorded with the LTQ detectors 
‘in parallel’ with the orbitrap full-scan detection. For all measurements with the 
orbitrap detector, alock-mass ion from ambient air (1m/z39 1.284286, 429.08875 or 
445.120025) was used for internal calibration as described”. 

For mass range experiments (similar to ‘gas-phase fractionation’) all samples 
were analysed using survey scan MS spectra in one of the following mass regions: 
m/z 300-500, m/z 450-650, m/z 600-900, m/z 850-1,250 and m/z 1,200-1,800. 
Resolution, lock mass option, ‘target value’ and number of intense peptide peaks 
selected for isolation were identical to full-scan analysis (see below), except for the 
mass range analysis m/z 1,200-1,800 where charge states =1 were allowed for 
isolation. All survey scans where acquired using injection waveforms, which 
applies a filter on the injection ions and thereby ejects all ions outside of the 
selected mass range. This ensures optimal dynamic range because the ion trap 
will only be filled with a population of ions belonging to the mass range of interest. 
Identification and quantification of peptides and proteins. The data analysis 
was performed with the MaxQuant software as described'’ supported by Mascot 
as the database search engine for peptide identifications. Peaks in MS scans were 
determined as three-dimensional hills in the mass-retention time plane. They 
were then assembled to isotope patterns and SILAC pairs by graph-theoretical 
methods. MS/MS peak lists were filtered to contain at most six peaks per 100 Da 
interval and searched by Mascot (Matrix Science) against a concatenated for- 
ward and reversed version of the yeast ORF database (Saccharomyces Genome 
Database SGDTM at Stanford University; http://www.yeastgenome.org). 
Protein sequences of common contaminants, for example, human keratins 
and proteases used, were added to the database. The initial mass tolerance in 
MS mode was set to 7 p.p.m. and MS/MS mass tolerance was 0.5 Da. Cysteine 
carbamidomethylation was searched as a fixed modification, whereas N-acetyl 
protein, N-pyroglutamine and oxidized methionine were searched as variable 
modifications. Labelled arginine and lysine were specified as fixed or variable 
modifications, depending on the previous knowledge about the parent ion. The 
resulting Mascot .dat files were loaded into the MaxQuant software’* together 
with the raw data for further analysis. SILAC peptide and protein quantification 
was performed automatically with MaxQuant using default settings for para- 
meters. Here, for each SILAC pair the ratio is determined by a robust regression 
model fitted to all isotopic peaks and all scans that the pair elutes in. SILAC 
protein ratios are determined as the median of all peptide ratios assigned to the 
protein. Absolute protein quantification was based on extracted ion chromato- 
grams of contained peptides. To minimize false identifications, all top-scoring 
peptide assignments made by Mascot were filtered based on previous knowledge 
of individual peptide mass error, SILAC state and the correct number of lysine 
and arginine residues specified by the mass difference observed in the full scan 
between the SILAC partners. Furthermore, peptide assignments were statistically 
evaluated in a Bayesian model on the basis of sequence length and Mascot score. 
We accepted peptides and proteins with a false discovery rate of less than 1%, 
estimated on the basis of the number of accepted reverse hits. 

Gene ontology and Pfam domain overrepresentation analysis. P values for the 
overrepresentation of gene ontology categories and protein domain content 
were based on a Wilcoxon—Mann—Whitney test for the presence—absence pattern 
of each category and the ratio significance as a continuous value. All P values 
below 0.01 are reported. To determine classes of proteins that show a high 
protein ratio but only low response on the transcript level, we defined a protein 
population with a protein ratio above two and a transcript ratio between one- 
half and two. We looked for enrichment of Gene Ontology terms in this class of 
proteins compared to the rest by calculating the P value according to the Fisher 
exact test. 

SILAC-assisted peptide-sequence-tag searching for ambiguous ORFs. 
Fragment ion intensities in spectra from ‘light’ and ‘heavy’ forms of a SILAC 
peptide pair are highly correlated. The only difference between their spectra is 
that C-terminal fragment ions (y-ions) are offset by 8.014 Da or other multiples 
of the difference between normal and heavy labelled amino acids. Extraction of 
y-ions is therefore straightforward and examples are shown in Supplementary 
Figs 2-4 for each of the three ORFs initially assumed not to be expressed. 
Searching these SILAC confirmed fragment ions (y-ions) in the yeast database 
as peptide-sequence tags** unambiguously verified identification of the ORFs. 
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Crystal structure of a stable dimer reveals the 
molecular basis of serpin polymerization 


Masayuki Yamasaki’, Wei Li’, Daniel J. D. Johnson’ & James A. Huntington’ 


Repeating intermolecular protein association by means of f-sheet 
expansion is the mechanism underlying a multitude of diseases 
including Alzheimer’s, Huntington’s and Parkinson’s and the 
prion encephalopathies'. A family of proteins, known as the ser- 
pins, also forms large stable multimers by ordered f-sheet linkages 
leading to intracellular accretion and disease’. These ‘serpin- 
opathies’ include early-onset dementia caused by mutations in neu- 
roserpin, liver cirrhosis and emphysema caused by mutations in 
@,-antitrypsin (a@,AT), and thrombosis caused by mutations in 
antithrombin’. Serpin structure and function are quite well under- 
stood, and the family has therefore become a model system for 
understanding the f-sheet expansion disorders collectively known 
as the conformational diseases*. To develop strategies to prevent 
and reverse these disorders, it is necessary to determine the struc- 
tural basis of the intermolecular linkage and of the pathogenic 
monomeric state. Here we report the crystallographic structure 
of a stable serpin dimer which reveals a domain swap of more than 
50 residues, including two long antiparallel B-strands inserting in 
the centre of the principal f-sheet of the neighbouring monomer. 
This structure explains the extreme stability of serpin polymers, the 
molecular basis of their rapid propagation, and provides critical 
new insights into the structural changes which initiate irreversible 
B-sheet expansion. 

Serpin polymerization was first described for the Z variant of x, AT?. 
Liver biopsies demonstrated an accumulation of «AT in the endo- 
plasmic reticulum of hepatocytes from patients homozygous for the 
mutation, and electron micrographs of the purified inclusions 
showed a distinct beads-on-a-string morphology, with occasional 
self-terminating circular necklaces. Wild-type «,AT can be induced 
to form a similar beads-on-a-string morphology and corresponding 
ladders on native polyacrylamide gel electrophoresis (PAGE) by 
incubation under mild-denaturing conditions. The apparent repeated 
and ordered intermolecular linkage resulted in the phenomenon 
being called ‘polymerization’, and multiple mutations in several ser- 
pins have since been identified (indicated in Fig. 1a), most of which 
cause disease by the accumulation of polymers within cells’. As a 
protein family, serpins are particularly susceptible to the formation 
of stable polymers owing to the metastability of their native fold 
(Fig. 1a) and the thermodynamically driven B-sheet expansion mech- 
anism required for their role as protease inhibitors*”. Approximately 
—32 kcal mol’ is released by the incorporation of the reactive centre 
loop (RCL) of a serpin into the centre of its main B-sheet’? (sheet A, 
Fig. la and b). On the basis of the observations that serpin polymers 
had comparable hyperstability and that exogenous RCL peptides were 
capable of blocking polymerization, it was suggested that the poly- 
meric linkage involved the incorporation of a portion of the RCL of 
one monomer into the B-sheet A of another’. 

We set out to determine the molecular basis of serpin polymeriza- 
tion by protein crystallography. However, the inherent propensity of 


polymers to continue to form higher order species after purification 
precluded concentration and crystallization. As mentioned pre- 
viously, serpin polymers can self-terminate to form necklace struc- 
tures, and a secreted self-terminating antithrombin dimer was 
recently observed in the blood plasma of patients carrying the P80S 
mutation’’. The dimer was composed of two hyperstable monomers 
and was incapable of continued polymerization, making this material 
amenable to crystallization. We were unable to obtain sufficient 
quantities of the dimeric mutant, so we produced and crystallized a 
self-terminating dimer from plasma-derived wild-type antithrombin 
(Methods and Supplementary Fig. 1), and solved its structure to 2.8 A 
resolution (Supplementary Table and Supplementary Methods). 

A single dimer was found in the asymmetric unit, composed of two 
latent-like (Fig. 1b) monomers arranged in a head-to-tail, back-to- 
back fashion (Fig. 1c). Close inspection of the electron density 
(Supplementary Fig. 2) revealed the presence of an extensive domain 
swap involving the insertion of strands 4 and 5 of one monomer into 
the B-sheet A of the other. The domain swap began immediately after 
helix I at residue 340 (Fig. 1c and Supplementary Fig. 2b), and con- 
tinued to residue 390 at the end of strand 4A. Thus, residue Gly 339 is 
the first hinge point for a perfect domain swap that extends from the 
carboxy terminus of helix I to strands 5 and 4 of B-sheet A, with 
residues 340-390 occupying equivalent positions on the dimer part- 
ner as found in hyperstable monomeric conformers of antithrombin 
(latent or RCL-cleaved). From the end of strand 4A, the return to the 
parent monomer is direct, with no intermolecular or native contacts 
until residue 407 at the beginning of strand 5 of f-sheet B 
(Supplementary Fig. 2c). As with the latent conformer (Fig. 1b), 
residues C-terminal to the RCL—which form strand 1C in native 
antithrombin (400—405)—have been extracted from sheet C to pro- 
vide sufficient length for return from strand 4A. The second hinge of 
the observed domain-swap is thus the loop between strands 1C and 
5B, at residue 406. This large-scale domain swap completes B-sheet A 
in an unexpected fashion, and reveals the molecular basis behind the 
hyperstability of the polymeric linkage. 

The structure of the self-terminating antithrombin dimer pre- 
sented here fully explains the molecular pathology of the P80S vari- 
ant. A question remains as to whether the observed intermolecular 
linkage involving strands 4A and 5A is specific for antithrombin or 
whether it is a general feature of serpin polymers. To investigate, we 
conducted the classic domain-swap experiment on the prototypical 
serpin «AT. Cysteines were substituted at adjacent positions on 
strands 5 and 6A (indicated in Fig. 1b), so that intermolecular 
disulphide bonds should only form after polymerization when these 
strands are donated by different protomers. Although the inhibitory 
activity was unaffected, the oxidized monomeric form of the variant 
was fully resistant to polymerization (Fig. 2a), consistent with the 
requirement of strand 5A release for polymer formation. However, 
under reducing conditions the variant polymerized in a manner 
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indistinguishable from wild type. Reoxidation of the polymeric 
material and subsequent denaturation by SDS showed the predicted 
intermolecular disulphide bonds (Fig. 2a), suggesting that the dual- 
strand domain swap (strands 4A/5A) observed for antithrombin also 
underlies the polymerization of «AT. 

For the practical reasons mentioned earlier, our crystallographic 
efforts were limited to the self-terminating dimer of antithrombin. 
Although this is sufficient to determine the intermolecular linkage, it 
was clear that the resulting structure would not represent the linear, 
self-propagating conformation, and that molecular modelling would 
be required. To create a dimer capable of continued polymerization, 
we linearized the self-terminating antithrombin dimer by breaking one 
intermolecular contact and placing the monomers in a head-to-tail 


Figure 1| Crystallographic structures of active, latent and self- 
terminating dimer of the serpin antithrombin. a, Active antithrombin is 
shown with the RCL on top (yellow) and the B-sheet A facing (red, with 
numbered strands). Either by proteolytic cleavage in the RCL or by 
extraction of strand 1C (s1C, orange), serpins incorporate the RCL into 
B-sheet A as strand 4A (s4A) resulting in a hyperstable six-stranded 
conformation. Polymerogenic mutations are shown (yellow is Z, magenta is 
Mmalton, blue is Siityama and Syracuse, red and cyan are His338Arg and 
Gly392Glu neuroserpin’, and green is P80S antithrombin’’). The loop 
connecting strand 6A to 5A is in cyan. b, The latent conformer of 
antithrombin is shown coloured as in a. Residues on strands 5 and 6A, which 
were mutated to Cys (see Fig. 2a), are indicated by green balls. c, The 
structure of the stable antithrombin dimer (monomer A coloured as in b, and 
monomer B is pale green). 
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configuration (Fig. 3a). The resulting dimer was capable of self- 
propagation to form polymers of infinite length with no steric clashes 
(Fig. 3b). However, this configuration required the unravelling of 
residues within and just C-terminal to helix I, to create a flexible 
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Figure 2 | Biochemical properties of serpin polymers and the M* state. 

a, The left panel is a native gel and the right panel is an SDS gel, and lanes are 
indicated as native (N), after polymerization (P), Cys-free control (WT), 
disulphide variant (S—S) and reduced variant (2SH) AT. P,.ox indicates 
chemical oxidation of the reduced variant polymers, and the positions of the 
oxidised (ox) and reduced (red) monomers are shown. Molecular mass 
standards are indicated. b, SDS-PAGE of AspN cleavage reaction for native 
(N) and polymerized (P) antithrombin (AT) and «AT. Several of the bands 
were sequenced, and those indicated by the arrows are shown in Fig. 3a. 

c, Native PAGE visualized by Coomassie staining (left panel) or fluorescence 
(right panel) assesses the reactivity of a Cys residue in the centre of helix I in 
RCL-cleaved (clvd), native (N) and polymerized (P) AT. Lanes with ‘+’ 
indicate that material was incubated with the fluorescent probe 
(didansylcysteine, DDC) just before electrophoresis. The positions of 
monomer (M), dimer (D) and trimer (T) are indicated. d, The proteolytic 
susceptibility of the native (N) and the M* state of antithrombin and «AT 
was assessed using AspN, V8 and LysC by SDS-PAGE, as indicated. Arrows 
indicate cleavage sites shown in Fig. 3c. 
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Figure 3 | Models of the serpin polymer and the M* state. a, A model of a 
linear serpin dimer (coloured as in Fig. 1c), with balls indicating AspN 
cleavage sites found in polymers of antithrombin (magenta) and «,AT 
(cyan). The yellow ball shows the position of the Leu303Cys substitution 
(Fig. 2c). b, A modelled pentamer was formed by the addition of protomers 
to both ends of the dimer. ¢, A model of the polymerogenic folding 
intermediate (M*) with the contiguous loop from the C terminus of strand 
6A to strand 1C in a random-coil conformation. New proteolytic sites in this 
region (magenta balls for antithrombin, cyan balls for «,AT and green balls 
for PAI-1; ref. 20) were detected for the M* state (Fig. 2d). 
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intermolecular linker from a once stably folded region. To test this, we 
subjected native and polymeric antithrombin and «AT to cleavage by 
the specific metalloprotease AspN. We found that polymers were con- 
sistently more susceptible to digestion by AspN than native controls 
(Fig. 2b), and amino-terminal sequencing revealed sites in and adja- 
cent to helix I for polymers of both antithrombin and «AT (plotted on 
Fig. 3a). The unravelling of helix I on polymerization of «,AT was 
assessed further by replacing a buried Leu in the centre of helix I with 
Cys (Leu303Cys position shown on Fig. 3a), and testing the reactivity 
of the sulphydryl to fluorescent probes in monomeric and polymeric 
forms. The sulphydryl was unreactive in native and RCL-cleaved states, 
consistent with a stable buried environment under helix I, but it was 
rapidly modified in the polymeric form (Fig. 2c), indicating the spe- 
cific polymerization-induced unfolding of helix I. 

Serpin polymerization in vitro is a two-step process, with an initial 
zero-order conversion to a polymerogenic monomer (denoted as 
M*) followed by a slower concentration-dependent intermolecular 
association step’*. The M* state can be populated by high temper- 
ature, low pH or chaotropic agents, and slowly refolds to the native 
conformation when returned to normal conditions!*"*. In contrast, 
serpin polymers do not require special conditions to propagate, and 
isolated dimers, trimers and so on, rapidly convert to higher order 
polymers under native conditions’. This property, recently referred 
to as ‘infectivity’, requires active donor and acceptor ends to persist 
when conditions are returned to those favouring the native state. Our 
structure provides a ready explanation for polymer infectivity. As 
illustrated in Fig. 3a, the top monomer of the linear dimer is com- 
pelled to remain in a state capable of donating strands 4 and 5A 
because its own B-sheet A is already complete, whereas the bottom 
monomer must remain in a quiescent acceptor conformation 
because its own strands 4 and 5 are inserted into the upper monomer. 
The model also explains why polymers are capable of incorporating 
exogenous peptides (possibly into the vacated strand 5A position of 
the bottom monomer), but not the RCLs (strand 4A) of native or 
peptide-complexed monomeric forms of the same serpin’” (because 
strand 5A is also required). 

To understand pathological serpin polymerization, why certain 
mutations predispose serpins to polymerize, and to develop stra- 
tegies to prevent or slow polymerization, it is necessary to define 
the structural features of the M* state. One important implication 
of our structure is that the principal feature of M* is the exposure of 
strand 5A. The M* state should thus resemble the monomeric com- 
ponent of the linear dimer (Fig. 3c), with the contiguous loop from 
the C terminus of strand 6A to the N terminus of strand 5B in an 
exposed and flexible conformation. This hypothesis is supported by 
the slower migration of M* on native PAGE“ and unfolding studies 
which showed that strand 1C and the loop between strand 5A and 
helix I are two of the first elements to unfold, and predictably, two of 
the last to fold’’. To investigate the predicted unfolding of this region 
in the M* state, we subjected antithrombin and «AT to limited 
proteolysis after brief exposure to conditions known to populate 
M* (Fig. 2d). The M* state was generally more susceptible to pro- 
teolysis than the native controls, and sequencing of the unique bands 
demonstrated cleavage sites in strand 5A and C-terminal to helix I for 
both serpins. Similar cleavage sites have been described for another 
serpin (PAI-1) after incubation with small hydrophobic com- 
pounds” (all sites plotted on Fig. 3c). The location of the pathological 
serpin mutations, on and underlying strand 5A, can now be under- 
stood in terms of a slowing of the final folding step from a polymero- 
genic intermediate with an exposed strand 5A to the native state with 
the strand incorporated in B-sheet A'*'*”’. 

Domain swapping is recognized as a general mechanism behind 
certain conformational diseases”, and although there are several 
examples in the literature of domain-swapped oligomers”, none 
represents the pathological linkage that results in a protein depos- 
ition disease. Our crystal structure is unique in this sense, and also 
differs in using two hinges and an internal motif, and in the hyperstability 
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of the intermolecular linkage. Our data suggests that the free-energy 
gained by completing B-sheet A is used to unfold helix I and the 
following coiled region. This unfolding event exposes a 30-residue 
linker that might explain why serpin polymers are hydrophobic”, 
prone to precipitation®”* and sensitive to seeding”. The linker is 
~50% hydrophobic and is predicted to have a high f-strand pro- 
pensity. Therefore, the linker could be responsible for the lateral 
association of linear serpin polymers into tangled aggregates, like 
those observed by electron microscopy from intact Collins bodies 
of neuroserpin’**. The precise mechanism of lateral association is 
unclear, but it is probable that the molecular pathology of serpin 
polymerization depends on such a secondary event for deposition 
and evasion of the cellular housekeeping machinery. Our findings 
have a profound effect on efforts to prevent, slow or reverse serpin 
polymerization. Strategies might target the M* state by increasing its 
rate of folding or by blocking the exposed strand 5A site. 
Alternatively, it may be possible to target the secondary lateral asso- 
ciation event with peptidic B-sheet breakers”. 


METHODS SUMMARY 


The stable antithrombin dimer was produced from plasma-derived native pro- 
tein and purified by gel filtration. Crystals were grown by vapour diffusion. Data 
processing and refinement statistics are given in the Supplementary Table. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Dimer preparation. The native «-glycoform of antithrombin was purified from 
outdated fresh-frozen human blood plasma, as previously described”. The stable 
dimer was produced by incubating 0.5 mg ml! antithrombin in 10 mM sodium 
acetate, pH 5.7, at 37°C for 16h. Supplementary Fig. 1a shows a typical time 
course under these conditions and the accumulation of the stable dimer. It 
migrated slightly faster than the polymerization-competent dimer on native 
PAGE, suggesting a more compact structure. Thrombin was unable to cleave 
the stable dimer, whereas precisely one monomer from the polymerization- 
competent dimer was cleaved in the RCL (Supplementary Fig. 1b). These data 
support the conclusion that the stable dimer was self-terminating by intermol- 
ecular RCL-f-sheet-A interactions. The stable dimer could also be formed by 
incubation at pH 7 and increasing the temperature to 53 °C, although the yields 
were low. The 16h incubation ensured that no native antithrombin remained 
and that the stable dimer was the only dimeric species. The reaction mixture was 
concentrated, buffer exchanged into 50mM Tris, pH 7.4, 0.3M NaCl, and 
loaded onto and eluted from a Superdex 200 HR10/30 size exclusion column 
(GE Healthcare). Fractions containing the dimer were pooled, concentrated to 
5mgml | in 10 mM Tris, pH 7.4, 50 mM NaCl for crystallization trials. 
Crystallization, data collection, molecular replacement and refinement. 
Initial hits were obtained using the robotic crystallization facility in the MRC- 
LMB”' from a screen of 384 conditions in 96-well sitting-drop vapour diffusion 
plates. Optimisation was conducted by hand in hanging-drop VDX plates, and 
diffraction quality crystals were eventually obtained in 20% PEG 3350, 0.2 M tri- 
potassium citrate. An initial, low-resolution data set was obtained from four 
crystals flash cooled in paratone-N on Diamond Light Source beamline 102. 
The data were processed using Mosflm, Scala and Truncate”, and were of suf- 
ficient quality (Rmerge = 0.246, mean (I/cI) = 7.0) and resolution (3.2 A) for 
molecular replacement and refinement. Molecular replacement solutions were 
found for two copies of antithrombin using native and latent antithrombin con- 
formers (1E04) with the program Phaser’; the latent conformer gave the best 
score. Examination of initial electron density (calculated from molecular replace- 
ment solutions using either native or latent antithrombin) showed that each 
monomer possessed a fully six-stranded B-sheet A with unbroken density for 
the turn between strands 5 and 4A. The structure was refined using heavy non- 
crystallographic symmetry (NCS) restraints for a large subset of the residues with 
the program CNS* (version 1.0) and rebuilt using XtalView® and Coot**. The 
domain swap was evident after the first round of refinement, and the linker 
regions were built into unequivocal positive difference density. We subsequently 
obtained a higher resolution data set from a single crystal on beamline 104 (wave- 
length of 0.919 A) after air dehydration and cryoprotection in paratone-N (100K). 
The Rgree set was taken from the initial data set and extended randomly to 2.8 A 
resolution, and the low-resolution model was subjected to rigid body refinement, 
with each core monomer constituting a rigid body. Heavy NCS restraints were 
used initially and subsequently relaxed, until in the final rounds NCS restraints 
were dropped altogether. Data processing and refinement statistics for the high- 
resolution data and structure are given in the Supplementary Table. The RCSB 
validation server placed 98.5% of the residues in the most favoured and addition- 
ally allowed Ramachandran regions, with no residues in the disallowed region. A 
similar calculation on the MolProbity server” classed 92.8% as favoured and 1.1% 
as outliers. Modelling an open AT dimer was conducted using the coordinates 
of RCL cleaved AT (PDB accession 1EZX) and programs XtalView*’ and CNS™. 
Figures were made using PyMol”™. 

a, AT Cys variants. The «AT used in this study was produced recombinantly 
from Escherichia coli as reported previously’. The template «AT was a double 
variant with a reactive centre mutation Met—Arg and the only Cys residue (232) 
mutated to Ala. Mutagenesis was conducted using the Quick Change kit and 
associated protocol (Strategene). This material was used for the limited proteo- 
lysis studies and was the background for mutations in which specific Cys residues 
were introduced. To test whether strand 5A is part of the domain swap for «AT, 
we introduced Cys residues on adjacent, externally exposed positions on strands 
5A and 6A (Thr339Cys and Ser292Cys, respectively). The disulphide bond spon- 
taneously formed during purification of the variant, as indicated by faster mobi- 
lity on SDS-PAGE. The inhibitory activity of the variant was indistinguishable 
from wild type, indicating that B-sheet A expansion was unperturbed by the 
intramolecular disulphide bond between strands 5 and 6A. After reduction of the 
disulphide bond (15 min incubation in 2mM dithiothreitol (DTT) at 37 °C), 
polymerization was induced by incubation in 0.75 M guanidine-HCl for 8h at 
37°C in 50mM sodium phosphate buffer, pH 7.0. The reaction mixture was 
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subsequently dialysed against Tris-buffered saline overnight at 4°C, and the 
disulphide bond was re-formed by adding tetramethylazodicarboamide to 
0.5mM for 2h at room temperature. Wild type and the oxidized variant were 
treated identically, but without the initial incubation with DTT. Samples were 
run on 8% native and 10% SDS-PAGE. To test the unfolding of helix I in the 
a,AT polymers, Leu 303 was substituted by Cys, and the accessibility of the 
sulphydryl was probed for native, RCL-cleaved and polymers by reacting with 
DDC (Sigma). The inhibitory activity of the Lys303Cys variant was indistin- 
guishable from wild type, but was mildly polymerogenic. The three forms of the 
variant were incubated with 0.8 mM DDC for 5 min at room temperature in the 
dark, just before loading on native PAGE. Similar results were obtained using the 
probe 5-iodoacetomidofluorescein (data not shown). 

Limited proteolysis. The exposure of distinct regions on polymerization of 
antithrombin and «AT was tested by limited proteolysis using the highly spe- 
cific metalloprotease, AspN, which cleaves peptide bonds N-terminal to available 
Asp residues. Polymers of antithrombin were prepared by incubating 
0.5mgml_! antithrombin in 10mM sodium phosphate buffer (pH 6.5) at 
53°C for 1 to 2h. Native and polymeric forms were then subjected to AspN 
digestion (1:65 ratio by weight) overnight at 4°C. Reactions were stopped by 
adding SDS-loading buffer (including DTT) and boiling for 5 min. The samples 
were then run on a 12% Bis-Tris gel in MES buffer (Invitrogen). To identify the 
bands of interest, identical samples were run on a separate gel and blotted onto a 
PVDF membrane for N-terminal sequencing. All N-terminal sequencing was 
conducted by the Protein and Nucleic Acid Chemistry Facility, Department of 
Biochemistry, University of Cambridge. Polymers of recombinant «,AT were 
generated by incubation of 0.025 mg ml! «,AT in 50mM sodium phosphate 
buffer, pH 7.0, at 53 °C for 3 h. Native and polymeric forms were subject to AspN 
digestion (1:50 ratio by weight) overnight at 4°C. Samples were then precipi- 
tated with 10% TCA and washed with acetone. Pellets were resuspended in 
50 mM Tris, pH 8.4, and subjected to SDS-PAGE and N-terminal sequencing 
as above. Identical results were obtained when polymers of «AT were generated 
with 0.75 M guanidine-HCl at 37 °C. The M* form of antithrombin was gener- 
ated by incubation of 0.25mgml ! antithrombin in 10mM sodium citrate 
buffer (pH 4.5) at room temperature for 4h, followed by rapid reformulation 
into 50 mM Tris, 100 mM NaCl, pH 7.4, and the addition of proteases AspN, V8 
or LysC (1:50-1:100 ratio by weight). After an overnight incubation at 4 °C, the 
reactions were stopped by the addition of SDS-loading buffer and boiling. The 
samples were then subjected to SDS-PAGE and N-terminal sequencing as above. 
The M* form of «AT was generated by incubating 0.1 mgml! of «AT in 
50 mM sodium phosphate buffer, pH 7.0, containing 0.8 M guanidine-HCl at 
37°C for 2h. Native and M* (after reformulation as for antithrombin) «,AT 
were subject to AspN and LysC digestion as above (V8 cleaves «AT in the RCL 
and was therefore not used). Samples were then precipitated with 10% TCA and 
washed with acetone. Pellets were resuspended in 50 mM Tris, pH 8.4, and sub- 
jected to SDS-PAGE and N-terminal sequencing as above. Similar cleavage 
results were obtained for 7,AT when M* was generated by short incubations 
either at low pH (pH 4.5 and 37 °C) or at high temperature (pH 7 and 49 °C). 
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ROS3 is an RNA-binding protein required for DNA 
demethylation in Arabidopsis 


Xianwu Zheng’, Olga Pontes’, Jianhua Zhu°, Daisuke Miki’, Fei Zhang’*, Wen-Xue Li’, Kei lida’, Avnish Kapoor’, 


Craig S. Pikaard* & Jian-Kang Zhu’ 


DNA methylation is an important epigenetic mark for transcrip- 
tional gene silencing (TGS) in diverse organisms’ *. Recent studies 
suggest that the methylation status of a number of genes is dynami- 
cally regulated by methylation and demethylation””®. In Arabi- 
dopsis, active DNA demethylation is mediated by the ROS1 (repres- 
sor of silencing 1) subfamily of 5-methylcytosine DNA glycosylases 
through a base excision repair pathway*'*”’. These demethylases 
have critical roles in erasing DNA methylation and preventing TGS 
of target genes”*'°. However, it is not known how the demethylases 
are targeted to specific sequences. Here we report the identification 
of ROS3, an essential regulator of DNA demethylation that contains 
an RNA recognition motif. Analysis of ros3 mutants and ros1 ros3 
double mutants suggests that ROS3 acts in the same genetic pathway 
as ROS1 to prevent DNA hypermethylation and TGS. Gel mobility 
shift assays and analysis of ROS3 immunoprecipitate from plant 
extracts shows that ROS3 binds to small RNAs in vitro and in vivo. 
Immunostaining shows that ROS3 and ROS] proteins co-localize in 
discrete foci dispersed throughout the nucleus. These results demon- 
strate a critical role for ROS3 in preventing DNA hypermethylation 
and suggest that DNA demethylation by ROS1 may be guided by 
RNAs bound to ROS3. 

We developed a sensitive assay system in Arabidopsis to dissect 
active DNA demethylation genetically'®'*. The system consists of 
the RD29A-LUC transgene (firefly luciferase reporter driven by the 
stress-responsive RD29A promoter) and the non-allelic endogenous 
RD29A gene. The RD29A promoter is subjected to continuous short 
interfering RNA (siRNA)-directed DNA methylation such that active 
DNA demethylation is required to keep the RD29A and RD29A-LUC 
genes transcriptionally active. In ros] mutants, the RD29A promoter 
for both the transgene and endogenous gene becomes hypermethy- 
lated and both genes are silenced’. In addition, the 35S-NPTII trans- 
gene linked to RD29A-LUC is also silenced such that ros] mutant 
plants are sensitive to kanamycin. We isolated the ros3 mutant from a 
T-DNA mutagenized population”. Like ros] mutations, ros3 causes a 
substantial reduction in bioluminescence emission (Fig. la and 
Supplementary Fig. la, b) as well as sensitivity to kanamycin 
(Fig. 1b). Genetic analysis indicated that the ros3 mutation is reces- 
sive and affects a nuclear gene (data not shown). 

Northern blot (Fig. 1c) and nuclear run-on (Fig. 1d) assays showed 
that the RD29A-LUC and 35S-NPTII transgenes and the endogenous 
RD29A gene are repressed transcriptionally in ros3 plants. Compared 
with the wild type, both the endogenous (Fig. 2a) and transgene 
(Fig. 2b) RD29A promoters in ros3 mutants are substantially more 
heavily methylated at CpG, CpNpG (where N is A, T or C) and 
CpNpN sites (Supplementary Table 1). Southern blot analysis with 
methylation-sensitive restriction enzymes also indicated DNA 


hypermethylation in the RD29A promoters in the mutant 
(Supplementary Fig. 2a, b). Treatment with the cytosine methylation 
inhibitor 5-aza-2’-deoxycytidine increased RD29A-LUC expression in 
the ros3 mutant to wild-type levels (Supplementary Fig. 3). These 
results suggest that DNA hypermethylation is responsible for the 
TGS in ros3 mutant plants. 

The nrpdla-1 mutation in the largest subunit of RNA polymerase 
IVa blocks the accumulation of 24-nucleotide siRNAs corresponding 
to the RD29A promoter (data not shown). Analysis of nrpdlaros3 
double mutants showed that the nrpdla mutation causes a significant 
increase in RD29A-LUC expression (Fig. 2c, d) and a substantial 
decrease in CpG, CpNpG and CpNpN methylation at the transgene 
RD29A promoter (Fig. 2e) in the ros3 mutant background. The 
release of TGS in ros3 by nrpdla suggests that the siRNAs targeting 
the RD29A promoter are the initial trigger of TGS in the ros3 mutant. 
The levels of siRNAs targeting the RD29A promoter do not differ 
substantially between ros3 and the wild type (Supplementary Fig. 1c). 

To analyse the genetic interaction between ros3 and ros1, we crossed 
rosI-1 with ros3 and generated a ros! ros3 double mutant. Analysis of 
cytosine methylation levels at the endogenous RD29A as well as trans- 
genic RD29A promoters showed that the levels of CpG, CpNpG and 
asymmetric CpNpN methylation in ros! ros3 are similar to those in the 
ros1 single mutant (Fig. 2a, b and Supplementary Table 1). DNA methy- 
lation at the intergenic 80, Atlg23740389-0769, At4g5390602-0993 
and At2g1699280-9430 loci is higher in ros! and ros3 single mutants 
compared to the wild type, and the double mutant shows a methylation 
pattern similar to ros] (Supplementary Fig. 4a—d and Supplementary 
Table 2). Therefore, the effects of the ros] and ros3 mutations on the 
methylation levels are not additive, indicating that ROS3 and ROS1 
function in the same genetic pathway to demethylate DNA. 

The second and third pairs of true leaves in ros3 are narrower and 
slightly lobed compared to those in the wild type (Supplementary Fig. 
5a). On the basis of the leaf morphology and kanamycin sensitivity 
phenotypes, we mapped the ros3 mutation to a genomic region on 
chromosome V (Supplementary Fig. 5b). Sequencing of all annotated 
genes in this region revealed a deletion of 260 base pairs (bp) in ros3 
that includes the first intron and part of the second exon of 
AT5G58130 (Supplementary Figs 5c and 6). AT5G58130 encodes a 
protein with an amino-terminal RNA recognition motif (RRM), a 
central COG5406 domain found in the nucleosome binding factor 
SPN, and a carboxy-terminal sequence with a secondary structure 
similar to those of RRMs (Supplementary Figs 5c, 7 and 8). A wild- 
type genomic fragment of AT5G58130 was found to complement the 
luminescence, leaf morphology (Supplementary Fig. 5d, e) and DNA 
hypermethylation (Fig. 2a, b) phenotypes of the ros3 mutant, con- 
firming that AT5G58130 is ROS3. 
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Expression of a translational fusion of yellow fluorescence protein 
(YFP) at the N terminus of ROS3 in onion epidermal cells or 
Arabidopsis showed a clear nuclear localization of the fusion protein 
(Supplementary Fig. 9a). Close inspection revealed that the fusion 
protein is concentrated in discrete foci in the nucleoplasm and is also 
present in the nucleolus (Supplementary Fig. 9b, c). Analysis of GUS 
reporter gene expression driven by the ROS3 promoter suggested 
that ROS3 is ubiquitously expressed in plant tissues (Supple- 
mentary Fig. 9d). 

Because ROS3 contains an RRM domain, we tested whether it may 
bind to RNA. We expressed recombinant ROS3 or its truncated forms 
with an N-terminal histidine tag (Fig. 3a, b). Electrophoretic mobility- 
shift assays with single-stranded and double-stranded RNAs of differ- 
ent sizes (Supplementary Table 3) were carried out. ROS3 was able to 
bind single-stranded RNA of sequence b (corresponding to the 
RD29A promoter) that was either 21-, 24- or 26-nucleotides, but it 
did not bind 21-, 24- or 26-nucleotide single-stranded RNA of 


c Control Cold ABA NaCl 
WT ros3 WT ros3 WT ros3 WT ros3 
LUC . 
RD29A - - _ —_ —_ 
sad - 


NPT — 


= i a an BP GD 
ON bE BCA ACASASAS 


NPTII_ COR15A LUC 


-=-@< 


COR15A 


Figure 1| The ros3 mutation causes transcriptional gene silencing. 

a, Stress-induced expression of the RD29A-LUC transgene in wild type (WT) 
and ros3 mutant plants after treatment with 300 mM NaCl for 5h. b, Like 
ros1, the ros3 mutant plants are sensitive to kanamycin. c, Northern blots 
showing that ros3 reduces the transcript levels of LUC, NPTI and 
endogenous RD29A, but not of the control, CORI5A. Plants were either 
untreated (control) or treated with cold (4 °C) for 24h, 100 uM abscisic acid 
(ABA) for 3h, or 300 mM NaCl for 5h. d, Nuclear run-on assay showing the 
pre-mRNA levels of LUC, NPTII and RD29A genes in wild-type and ros3. 
CORI15A and CBF3 were used as controls. 
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sequence a (Fig. 3c). It also did not bind a 40-nucleotide single- 
stranded RNA (Fig. 3c). There was no binding (data not shown) to 
any double-stranded RNA or any other single-stranded RNA tested 
(Supplementary Table 3). Removal of the N-terminal RRM domain 
and the C-terminal region (Fig. 3c, lanes 4, 12 and 14; Fig. 3d, lane 4), 
or of the N-terminal RRM domain alone (Fig. 3d, lane 2), abolished 
binding to RNA of sequence b. However, removal of the C-terminal 
region did not substantially affect RNA binding (Fig. 3d, lane 3). The 
binding to the 24-nucleotide RNA (sequence b) is dependent on ROS3 
protein concentration, and is competed by excess unlabelled small 
RNA of the same sequence (Fig. 3e). These results show that ROS3 
has the capacity to bind specific small RNAs and that the RRM domain 
is required for this binding. Although ROS3 could bind 21-, 24- and 
26-nucleotide single-stranded RNAs of sequence b in vitro, RNA blot 
analysis showed that only a ~24-nucleotide small RNA of sequence b 
is present in Arabidopsis (Supplementary Fig. 10a). 

To determine whether ROS3 may bind small RNAs in vivo, ROS3 
protein was immunoprecipitated from Arabidopsis extracts using 
anti-ROS3 antisera. The immunoprecipitate was fractionated on a 
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Figure 2 | DNA hypermethylation in ros3 and suppression of ros3 by 
nrpdia. a, b, Bisulphite sequencing analysis of promoter methylation status 
of the endogenous RD29A (a) and RD29A-LUC transgene (b) in wild type, 
ros1, ros3, ros1 ros3, ros1 complemented with wild-type ROS1 transgene, and 
ros3 complemented with wild-type ROS3 transgene. c, Suppression of ros3 
by the nrpdla-1 mutation. Seedlings of wild type, ros3 and nrpd1a ros3 
double mutants grown in MS medium for 3 weeks were transferred to a filter 
paper soaked with 300 mM NaCl for 5h. Left, picture of the seedlings; right, 
luminescence image. d, Quantification of luminescence in c. Numbers on the 
y axis represent counts per seedling. Error bars represent standard deviation 
(n = 20). e, DNA methylation status at the transgene RD29A promoter in 
ros3 and nrpd1aros3 plants. 
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polyacrylamide gel and putative ROS3-bound small RNAs of ~15— 
30 nucleotides were extracted from the gel, cloned and sequenced. 
Out of 288 clones sequenced, 140 had inserts of 10-30 nucleotides 
(Supplementary Table 4). Notably, the sequences are very rich in Gs. 
Two 25-nucleotide sequences—RNA sequence c, 5’-GGGAG 
UCCGGAGACGUCGGCGGGGG-3’ and RNA sequence d, 
5'-UCGGGAGGGAAGCGGAUGGGGGCCG-3'’—were chosen for 
further analysis. These small RNAs correspond to a genomic region 
(intergenic between At3G41979 and At3TE58310) that appears to 
have higher DNA methylation in the rosl dml2 dml3 demethylase 
triple mutant than in Columbia wild-type control plants” (http:// 
neomorph.salk.edu/epigenome.html). Both RNA sequence c and 
RNA sequence d could be detected as ~25-nucleotide small RNAs 
in plants by northern blot analysis (Supplementary Fig. 10b, c). ROS3 
protein is able to bind RNA sequence c and RNA sequence d in vitro 
(Fig. 3f). Bisulphite sequencing analysis indicated higher levels of CG, 
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Figure 3 | ROS3 binds small RNAs. a, Diagram of ROS3 and its truncated 
mutant forms. b, Coomassie-stained SDS-PAGE gel showing the 
recombinant proteins used for RNA binding assays. ¢, ROS3 but not 
ROS3ANC binds 21-, 24- and 26-nucleotide single-stranded RNA of 
sequence b. Lanes 1, 3, 5, 7, 9, 11 and 13, ROS3; lanes 2, 4, 6, 8, 10, 12 and 14, 
ROS3ANC. d, ROS3AC but not ROS3AN or ROS3ANC binds 24-nucleotide 
single-stranded RNA of sequence b. Lane 1, ROS3; lane 2, ROS3AN; lane 3, 
ROS3AC; lane 4, ROS3ANC. e, Protein-concentration-dependent binding 
to 24-nucletoide single-stranded RNA of sequence b and competition by 
unlabelled small RNA. Lanes 1-4, increased binding to 24-nucleotide 
(sequence b) small RNA by increasing ROS3 protein concentration; lanes 
5-9, competition by increasing amounts of cold 24-nucleotide RNA 
(sequence b). f, ROS3 binds 25-nucleotide RNA sequences c and d in vitro. 
RNAs of sequences b and a are used as controls. Lanes 1, 3, 5 and 7, ROS3; 
lanes 2, 4, 6 and 8, ROS3AN. 
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CNG and CNN methylation in ros3 compared to the wild type in the 
genomic region corresponding to the small RNAs (Supplementary 
Fig. 11 and Supplementary Table 2). These results indicate that ROS3 
binds to small RNAs in vivo, and the small RNAs may direct demethy- 
lation of target sequences. Further studies may reveal whether ROS3 
is also capable of binding larger RNAs and what specific sequence 
features it recognizes. 

We immunolocalized ROS3 and ROS1 proteins using antibodies 
recognizing the native proteins or epitope tags fused to ROS1 and 
ROS3 recombinant proteins. Immunolocalization in Arabidopsis leaf 
nuclei at interphase revealed that in wild-type plants ROS3 is loca- 
lized in the nucleoplasm as well as the nucleolus (Fig. 4a—c). The 
ROS3 signals appear as scattered speckle-like structures, ranging in 
number from 5 to more than 12 per nucleus (Fig. 4 and 
Supplementary Table 5). This pattern is consistent with the YFP— 
ROS3 result (Supplementary Fig. 9b, c). ROS1 was similarly found to 
be dispersed throughout the nucleoplasm and nucleolus, although 
the immunolocalization signals tended to appear more diffuse and 
smaller than ROS3 foci (Fig. 4a, b and Supplementary Table 6). In 
ros3 and ros1 mutants, no significant signals could be obtained, which 
indicates that the antibodies are specific for ROS3 and ROS1, respect- 
ively (Fig. 4a). Moreover, epitope-tagged ROS3 and ROS1 proteins 
displayed interphase localization patterns very similar to the patterns 
observed for the wild-type proteins (Supplementary Fig. 12). 
Simultaneous immunolocalization of c-Myc-tagged ROS1 and 
native ROS3 revealed substantial co-localization of ROS1 and 
ROS3 (Fig. 4b and Supplementary Table 7). 
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Figure 4 | Co-localization of ROS3 with ROS1 in the nucleus of Arabidopsis 


mesophyll cells and assay of ROS7 and ROS3 mRNA levels. a, Localization 
of ROS3 and ROS] by their respective antibodies. C24, Arabidopsis thaliana 
C24 ecotype carrying the RD29A-LUC transgene (wild type). b, Dual 
immunolocalization of ROS3 and ROS1 with use of anti-ROS3 and anti- 
Myc. ¢, ROS1 localization in ros3 mutant and ROS3 localization in ros1 
mutant showing their interdependence for appropriate localization. Scale 
bars correspond to 5 jum. d, Quantitative RT-PCR assay of relative ROS1 
and ROS3 transcript levels in the various genotypes. Error bars represent 
standard deviation (n = 3). 
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ROS3 and ROS1 immunostaining was also carried out in ros! and 
ros3 mutants, respectively. ROS1 localization was found to be severely 
disrupted in the ros3 mutant background (Fig. 4c). The ROS3 inter- 
phase localization pattern was less changed in the ros] mutant, 
although ROS3 nucleolar localization was reduced in the ros] mutant 
(Fig. 4c). These results suggest that ROS1 and ROS3 are interdepend- 
ent for their nuclear, and especially nucleolar, co-localization. 

Because ROS] transcript levels are reduced in mutants defective in 
DNA methylation’, we tested whether ros3 may prevent demethylation 
by reducing ROS1 expression. ROS1 messenger RNA level was reduced 
in ago6-1 (ref. 17) mutants as expected, but was increased in ros3 
mutants (Fig. 4d). Similarly, ROS3 mRNA level was decreased in 
ago6-1 mutants but increased in ros] mutants (Fig. 4d). The results 
suggest that ros3 does not reduce ROS] mRNA levels and that the 
DNA methylation status at some critical sites may be sensed to regulate 
the transcript levels of active DNA demethylation factors such as ROS1 
and ROS3. 

Our results suggest that ROS3 functions in the same DNA 
demethylation pathway with ROS1. ROS3 is capable of binding small 
RNAs in vitro and in vivo, suggesting that small RNAs and/or larger 
RNAs bound to ROS3 may guide sequence-specific DNA demethyla- 
tion by ROS1. In plants, 24-nucleotide siRNAs bound to argonaute 4 
(AGO4) and AGO6 can direct DNA methylation and TGS**!771. 
Recent studies suggest that piwi-interacting small RNAs function 
to direct de novo DNA methylation and silencing of retrotransposons 
in mammalian germ cells”. In mammalian cells, promoter-directed 
siRNAs can also induce TGS”***. Notably, some promoter-directed 
small RNAs were recently found to cause gene activation in human 
cells***°. Our work showing that a protein required for DNA 
demethylation is capable of binding to small RNAs contributes to 
understanding the mechanism of DNA demethylation and adds to 
the expanding role of small RNAs in gene regulation”’”’””. It also sets 
the stage for future work to determine the relationship between small 
RNAs directing DNA methylation and TGS and those directing DNA 
demethylation and gene activation. Although our data suggest that 
ROS3 with its bound RNAs may guide sequence-specific demethyla- 
tion, it is also possible that ROS3 may suppress DNA methylation by 
sequestering RNAs that guide DNA methylation, or by promoting 
ROS! protein stability or nuclear localization. 


METHODS SUMMARY 


The ros3 mutant was isolated from a mutagenized population of RD29A-LUC 
plants by luminescence imaging’’. Gene silencing phenotypes of the mutant were 
examined by RNA blot analysis and nuclear run-on assays. DNA methylation 
levels were assessed by bisulphite sequencing of genomic DNA or Southern blot 
analysis. The ros3 mutant was mapped and cloned following a previously pub- 
lished strategy’. The subcellular localization of ROS3 protein was determined by 
confocal imaging of cells or plants expressing YFP—ROS3 (driven by the 35S 
promoter). The subnuclear co-localization of ROS3 and ROS1 proteins was 
examined by immunostaining using antibodies against the wild-type protein 
or epitope tags. Bacterially expressed recombinant ROS3 protein and its trun- 
cated forms were purified and used for gel mobility shift assays to examine RNA 
binding in vitro. To test in vivo RNA binding by ROS3, ROS3 protein was 
immunoprecipitated from plant extracts. Small RNAs from the immunopreci- 
pitate were identified by cloning and sequencing. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plant materials, mutant isolation and gene cloning. Unless specified otherwise, 
wild type in this study refers to the Arabidopsis thaliana C24 ecotype carrying the 
RD29A-LUC transgene. A T-DNA insertion population was obtained by T-DNA 
mutagenesis in wild type'°. T2 seedlings were screened for mutants on the basis of 
luminescence emission. Luminescence imaging was carried out as described”*. 
RNA analysis. Total RNA was extracted from 15-day-old seedlings by using 
guanidine thiocyanate (GT) extraction solution. Total RNA and small RNA 
extraction, and mRNA and small RNA northern blots were as described pre- 
viously’’. Nuclear run-on assays were carried out as described”. Real-time PCR 
was carried out as described’. 

DNA cytosine methylation assay. DNA methylation (by Southern blot or bisul- 
phite sequencing) was analysed on leaves of 4-week-old soil-grown plants and 
was carried out as described'"'”*°. For 5-aza-2'-deoxycytidine (Sigma) treat- 
ment, seedlings were grown for 3weeks on MS agar medium containing 
7 ugml~! of the drug and then transferred to Whatman paper containing 1/2 
MS liquid medium and 300 mM NaCl for 5h before luciferase imaging. 
Positional cloning of ROS3. To clone the ROS3 gene, the homozygous ros3 
mutant in the C24 background with RD29A-LUC transgene was crossed to 
wild-type plants of the Columbia ecotype without the RD29A-LUC transgene. 
The F, population was screened for ros3 mutants based on kanamycin sensitivity 
of plants containing the 35S-NPTII transgene and based on leaf morphology. We 
mapped the ros3 mutation to the bottom arm of chromosome V on the basis of 
the 35S-NPTII transgene silencing phenotype. For fine mapping, we used the leaf 
developmental phenotype of ros3. Using SSLP markers, ros3 was first mapped to 
chromosome 5 between markers on MYN8 and MTH12. Through fine mapping 
the ros3 mutation was narrowed down to BAC clone K21L19. To identify the ros3 
mutation, candidate genes from wild-type and ros3 mutant plants were 
sequenced. 

RNA electrophoretic mobility shift assay. ROS3-His fusion protein and dif- 
ferent truncated ROS3-His fusion proteins were cloned into the pET-28a vector 
and expressed in Escherichia coli BL21 cells. The recombinant proteins were 
purified using Ni-NTA Agarose (Qiagen). Protein-small RNA binding assays 
were done according to ref. 31 with slight modifications. Briefly, protein binding 
to RNAs was carried out at room temperature for 30 min in a 30 jl binding mix 
(10mM HEPES, pH7.6, 3mM MgCl, 40mM KCl, 5% glycerol, 2mM DTT, 
100nM [y-*’P]ATP-labelled RNA, and 11g ROS3 protein unless indicated 
otherwise). To test the effect of varying ROS3 protein amount on RNA binding, 
0.25, 0.5, 0.75 and 1 pg ROS3 were used in the assays. To test competition by cold 
RNA, the unlabelled competitor RNA at 100, 300, 500, 2,000 or 10,000 nM was 
mixed with the labelled RNA before addition of ROS3 protein. Binding products 
were resolved on a 3% polyacrylamide native gel. 

Immunolocalization. Leaf nuclei used for immunolabelling were isolated from 
28-day-old plants as described previously’. Following post-fixation in 4% para- 
formaldehyde/PBS and blocking with 2% BSA/PBS, the slides were incubated 
with the following antibodies: rabbit anti-ROS1, rabbit anti-ROS3, mouse anti- 
Flag (Sigma) or mouse anti-c-Myc (Sigma). Anti-ROS1 and anti-ROS3 antibod- 
ies were custom made by Sigma by injecting rabbits with synthetic peptides 
CLHLPEPFQREQGSE and CRSLDENRQRLNGKEN, respectively. Secondary 
antibodies were anti-rabbit or anti-mouse antibodies conjugated to FITC or 
TRITC (Sigma-Aldrich). In ros3 mutant plants expressing epitope tags fused 
to ROS1 and ROS3 recombinant proteins, the transgenes (expressed via their 
respective endogenous promoters) fully rescued the ros3 and ros] mutant 
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phenotypes (data not shown), which indicates that the recombinant proteins 
are functional. 

Transgenic plants. ROS3 genomic DNA (including ~1kb upstream of the 
initiation codon and 300 bp downstream of the stop codon) was amplified from 
BAC clone K21L19 with Platinum pfx DNA polymerase (Invitrogen) with the 
forward primer 2711-gDNA-5F and reverse primer 2711-gDNA-3R. The PCR 
product was cloned between the SacI and HindIII sites of pCAMBIA 1305.1. 
ROS3-flag and ROS1-myc constructs were constructed by inserting ROS3 geno- 
mic DNA with flag tag and ROS1 genomic DNA with myc tag into pCcAMBIA 
1305.1. ROS3 cDNA was amplified by RT-PCR with forward primer 2711- 
cDNA-GFP-5F and reverse primer 2711-cDNA-GFP-3R (Supplementary 
Table 3). The amplified product was cloned into pDONR207 (Invitrogen). 
pEarlyGate104 was used to express YFP—ROS3 fusion protein in Arabidopsis. 
For the ROS3 promoter-GUS construct, the ROS3 promoter was amplified with 
forward primer 2711-pro-GUS-5F and reverse primer 2711-pro-GUS-3R. The 
PCR product was cloned into the PstI and EcoRI sites of pCAMBIA 1391Z. The 
constructs were introduced into Arabidopsis through floral dip transforma- 
tion using Agrobacterium GV3101. Transgenic lines were selected based on 
hygromycin resistance for ros3 complementation, and Basta resistance for 
YFP-ROS3 and ROS3 promoter-GUS expression. GUS staining was as described 
previously’”. 

ROS3 protein pull-down and small RNA cloning. The C-terminal portion of 
ROS3 protein (amino acids 103-749) was cloned into the pET28a vector 
(Novagen), expressed in E. coli, and purified. Anti-ROS3 antisera were produced 
at Cocalico Biologicals Bio by injecting rabbits with the recombinant protein. 
Anti-ROS3 antisera were affinity purified by using the SulfoLink protein kit 
(Pierce, SulfoLink protein kit number 44995). Two-week-old wild-type 
Arabidopsis seedlings were ground in liquid nitrogen and the extract was used 
for immunoprecipitation of ROS3 protein and any RNAs that may be bound to 
ROS3 (ref. 33). The immunoprecipitate was fractionated on a denaturing poly- 
acrylamide gel, and putative small RNAs in the 15—30-nucleotide size range were 
excised from the gel and used for small RNA library construction following 
established protocols**. PCR products were then cloned into the pGEM-T easy 
vector system (Promega, number A1360) and sequenced. 
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Structural insights into amino acid binding and gene 
control by a lysine riboswitch 


Alexander Serganov'™, Lili Huang’* & Dinshaw J. Patel’ 


In bacteria, the intracellular concentration of several amino acids 
is controlled by riboswitches’*. One of the important regulatory 
circuits involves lysine-specific riboswitches, which direct the bio- 
synthesis and transport of lysine and precursors common for 
lysine and other amino acids’ *. To understand the molecular basis 
of amino acid recognition by riboswitches, here we present the 
crystal structure of the 174-nucleotide sensing domain of the 
Thermotoga maritima lysine riboswitch in the lysine-bound (1.9 
angstr6m (A)) and free (3.1 A) states. The riboswitch features an 
unusual and intricate architecture, involving three-helical and 
two-helical bundles connected by a compact five-helical junction 
and stabilized by various long-range tertiary interactions. Lysine 
interacts with the junctional core of the riboswitch and is speci- 
fically recognized through shape-complementarity within the 
elongated binding pocket and through several direct and Kt- 
mediated hydrogen bonds to its charged ends. Our structural 
and biochemical studies indicate preformation of the riboswitch 
scaffold and identify conformational changes associated with the 
formation of a stable lysine-bound state, which prevents alterna- 
tive folding of the riboswitch and facilitates formation of down- 
stream regulatory elements. We have also determined several 
structures of the riboswitch bound to different lysine analogues’, 
including antibiotics, in an effort to understand the ligand- 
binding capabilities of the lysine riboswitch and understand the 
nature of antibiotic resistance. Our results provide insights into a 
mechanism of lysine-riboswitch-dependent gene control at the 
molecular level, thereby contributing to continuing efforts at 
exploration of the pharmaceutical and biotechnological potential 
of riboswitches. 

RNA sensors play a crucial part in many regulatory loops, owing to 
their capacity for directing gene expression in response to various 
stimuli in the absence of protein participation® *. Recent three-dimen- 
sional structures of a thermosensor’, a metallosensor’’, a metabolite- 
bound ribozyme'’” and riboswitches specific for purine nucleo- 
bases'*"*, and for co-enzymes thiamine pyrophosphate'’*'” and 
S-adenosylmethionine’*”” have highlighted how each ribosensor uses 
unique structural features to sense its cognate stimulus. However, the 
molecular details of the organization of amino-acid-specific ribos- 
witches, such as the lysine riboswitch, which efficiently discriminates 
against other free amino acids, their precursors and amino acids 
within a peptide context’*°, remain obscure. The determination of 
the lysine riboswitch structure presents a considerable challenge, 
because of its large metabolite-sensing domain, predicted to form a 
five-way junction’®. 

The T. maritima riboswitch is a typical lysine riboswitch'*? 
(Supplementary Figs 1 and 2a) which uses a transcriptional attenu- 
ation mechanism to repress the production of aspartate-semialdehyde 
dehydrogenase’, which is involved in the synthesis of a precursor for 


methionine, threonine, lysine and diaminopimelate. The structure of 
the lysine-bound riboswitch domain, also known as an ‘L box’, fea- 
tures three-helical and two-helical bundles radiating from a compact 
five-helical junction (Fig. lac and Supplementary Fig. 2b—d) that 
contains lysine inserted into its core. The junction is organized on 
the basis of a modified four-way junction” through colinear stacking 
of helices P1 and P2, and helices P4 and P5, positioned as two inter- 
secting lines of an uneven letter ‘X’. The P2—P2a—L2 stem-loop 
reverses its orientation through two turns important for riboswitch 
function®”’: one of them adjacent to a loop E motif*” and the other 
centred on a turn that replaces the kink-turn motif** found in other 
lysine riboswitches (Supplementary Fig. 3). Stems P2 and P3 are 
aligned by an unusual kissing-loop complex between loops L2 and 
L3 (Fig. 1d and Supplementary Fig. 4), whereas parallel stems P2 and 
P4 are anchored by a conserved loop (L4)-helix (P2) interaction 
(Fig. le). 

The five-helical junction contains three layers of nucleotides, each 
composed of two interacting base pairs, organized around the cent- 
rally positioned lysine which fits into a tight pocket and is specifically 
recognized by its charged ends (Fig. 2a—c). This lysine-bound pocket 
architecture (Fig. 2) is also retained in lysine analogue complexes 
(Fig. 3) outlined later. The top layer, composed of G14-C78 and 
G115-C139 base pairs (Fig. 2d), is stabilized through ribose zipper 
and type I A-minor triple interactions with invariant A81, which in 
turn is paired with Na” -bound G80 (Fig. 2d and Supplementary Fig. 
5a). The middle layer, containing G12-C79 and G114*U140 base 
pairs, forms specific interactions with the bound lysine (Fig. 2e). 
Both the carboxylate and ammonium groups of the lysine ‘main 
chain’ segment are hydrogen-bonded to the minor groove edges of 
purine bases and sugar 2'-OH groups (Fig. 2b, e), whereas the 
€-ammonium protons of the side chain form hydrogen bonds with 
a non-bridging phosphate oxygen, a sugar ring oxygen and a tightly 
bound water molecule, W1 (Fig. 2c). In addition, lysine is sand- 
wiched between the A81 base (Fig. 2d) and the G11*G163 base pair 
from the bottom junctional layer (Fig. 2e and Supplementary Fig. 
5b), thereby stabilizing the top of the P1 helix which is necessary for 
gene expression control. The bound lysine facilitates the holding 
together of the stacked P1—P2 and P4—P5 helical segments, and con- 
tributes to the positioning of the P3 helix by locking up G80, which is 
placed over the e-ammonium group of lysine, and stacks with the 
A82-U113 base pair of P3 (Fig. 2a). This stable junctional conforma- 
tion is further reinforced by a tertiary stacking interaction between 
G110 and A164 (Fig. 2a) characteristic for riboswitches from ther- 
mophiles, as well as other interhelical contacts (Figs 1b, c and 2a). 

A notable feature of the lysine-binding pocket is a K* cation 
(Fig. 2b), which binds a carboxyl oxygen of lysine and zippers up 
the junction using several coordination bonds. The K™ cation is 
directly observed on the anomalous map (Supplementary Fig. 6) 
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and is replaceable by its mimics, Cs* and Tl*, but not by Mn** 
(Supplementary Figs 7-9), a mimic of Mg** (refs 25 and 26). The 
importance of K* for lysine binding has been demonstrated in pri- 
mer extension experiments. In the presence of lysine and at physio- 
logical concentration of K*, reverse transcriptase pauses before 
the junction at A169, reflecting the formation of a stable junctional 


C82 P2a-L2 turn 
a GU d 
60 P2a J! . 
GCAGGGC 
ollil i oyu A 
UGUCCCA 40 Ao A-50 


G-c 39 95/40 —-G 
Cc-G 176-4 L2 
Pena a U0 U-A--: Kissing A42 
Loop E o loops G95 


27-Go Uo A-65 
GQU A 


93-) © G-101 


“A 0G G-C 
C— A-U 
aa 90-G—C Gas 
C-G Ay G44 
A-u  c-@ PS G101 
U—A-120 aoe 
C-G 135-A —U -_ 
P2 C—G-% C-G Go U 6 io 
G—C Cc-G aoe 
C-G AoG ee 
GHC Cx=G AU: 
C4 Gyet14 A : 
IGI-ICE “,(G]-80 
ca 
Go |G A 
10-[A]o G- GI 
G-C G- 
G-C U 
C-G G 
A-U E 
G-C- 
Piece U 
C-G Cc 
Gou U 
+G—C 
y ¥ 


Loop- 
helix 


Figure 1| Overall structure and long-range tertiary interactions of the 
lysine-bound T. maritima riboswitch. a, Schematic of the riboswitch fold 
observed in the crystal structure of the complex. The bound lysine is in red. 
The RNA domains are depicted in colours used for subsequent figures. Base- 
specific tertiary contacts and long-range stacking interactions are shown as 
thin green and thick blue dashed lines, respectively. Nucleotides invariant in 
known lysine riboswitches are boxed. b, ¢, Overall lysine riboswitch 
structure in a ribbon representation showing front (b) and rotated by ~60° 
(c) views. d, The L2-L3 kissing loop interaction is formed by six base pairs, 
supplemented by interstrand stacking interactions between A42 and C95, 
G43 and U94, and G44 and G101. Hydrogen bonds between interstrand base 
pairs and orthogonally aligned G43 and U94 bases are depicted by dashed 
lines. e, The L4-loop—P2-helix interaction formed by an insertion of the 
A126—A127-A129 stack of L4 into the RNA groove of P2 distorted by non- 
canonical base pairs. 
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conformation (Fig. 4a, lanes 8, 10). The pause decreases 33-fold after 
the replacement of K* by Na* (Fig. 4a, lanes 2, 4), suggesting that 
there is either reduced stability or failure to generate a junction 
competent for lysine recognition under K*-free conditions. These 
results have been supported by equilibrium dialysis experiments with 
T. maritima and Bacillus subtilis riboswitches. In both cases, lysine 
binding affinity significantly decreased when K* was omitted or 
replaced by Na* (Fig. 4b). Note that lysine binds better to a ribos- 
witch from a thermophile than from a mesophile. 

The preference of K* for binding to the negatively charged car- 
boxylate group contrasts with Mg**-mediated phosphate recog- 
nition in other ribosensors’’'*’*'® and might also be a 
characteristic for other amino-acid-specific riboswitches. Because 
more than 20 lysine-binding proteins (listed in the Protein Data 
Bank) do not use cations to mediate lysine recognition, this feature 
is probably unique to RNA, given that it lacks the positively-charged 
side chains found in proteins. 

Folding of most RNAs, including the B. subtilis lysine riboswitch”’, 
requires Mg~* . However, the crystals of the T. maritima lysine ribos- 
witch can be grown in the absence of Mg”* (Supplementary Fig. 10). 
Moreover, a Mn?* soak does not replace cations in the structure 
grown with Mg**, and, on the basis of the coordination distances, 
most of these cations can be assigned as Na”. These results indicate 
that the L box from a thermophile does not critically depend on 
Mg”, the function of which can be co-opted by monovalent cations. 

The identification of antibiotic-resistance mutations”””* in the 
lysine riboswitch'”, together with the demonstration ofa direct inter- 
action between the riboswitch and lysine-like antibacterial com- 
pounds’*”, suggest that riboswitch targeting, along with other 
processes”’, is an important component of the antibiotic activity. 
To understand the molecular basis of antibiotic resistance and 
explore the pharmaceutical potential of the lysine riboswitch, we 
have determined structures of the riboswitch bound to antibacterial 
compounds S-(2-aminoethyl)-L-cysteine (AEC) and 1-4-oxalysine’, 
which contain sulphur and oxygen at position C4, respectively 
(Fig. 3a). Because the pocket has a small cavity between the C4 and 
N7 positions of bound lysine (Fig. 3c and Supplementary Fig. 11a), 
both C4-substited analogues can be placed within the pocket in a 
manner similar to bound lysine (Fig. 3b, top panel), suggesting the 
potential for incorporation of even larger C4-substituents. Despite 
similar placement within the pocket, primer extension assays suggest 
that there is weaker binding of AEC and L-4-oxalysine to the ribos- 
witch (Fig. 4c and Supplementary Fig. 12), possibly due to an 
increased electronegativity of substituents at the C4 position. 

Next, we determined the structures with lysine analogues 
L-homoarginine and N6-1-iminoethyl-L-lysine, where the ¢- 
ammonium group of lysine is replaced by a guanidinium group 
and its methyl-substituted variant, respectively (Fig. 3a)°. In these 
structures (Fig. 3b), the side chains of lysine analogues are slightly 
shifted to provide better stacking interactions with the G80 base. The 
nitrogen atoms of the guanidinium-like extensions replace water 
molecules W1 and W2, found in the lysine complex (Fig. 2b). The 
G163 sugar is slightly rotated, so that the hydrogen bond pattern of 
WL is retained by both ligands, whereas L-homoarginine forms extra 
hydrogen bonds with G163. Although most RNA-ligand contacts are 
preserved, both analogues demonstrate weaker interactions than 
lysine in primer extension (Fig. 4c and Supplementary Fig. 12) and 
in-line probing? experiments, emphasizing the fine structural com- 
plementarity between the e-ammonium group of lysine and RNA. 

The lysine-binding pocket has two openings, which could be 
exploited for the design of next generation lysine-like analogues 
(Fig. 3c, d and Supplementary Fig 11b, c). One of the openings could 
accommodate modifications or extensions of the carboxylate group, 
possibly by substituting the K* cation. The other smaller opening 
could allow extensions from N9 of L-homoarginine and iminoethyl- 
L-lysine. The analogue-bound structures and the biochemical data° 
indicate that the lysine-binding pocket is rather rigid, and only 
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Figure 2 | Structure and interactions in the junctional region of the lysine 
riboswitch. a, Stereo view of the junction with bound lysine. Green sphere 
depicts a K™ cation. b, Details of riboswitch lysine interactions. Lysine is 

positioned within the omit F, — F, electron density map contoured at 3.50 


accommodates compounds which can sterically fit the pocket. 
Therefore, lysines in a polypeptide chain and branched amino acids 
are not recognized by the riboswitch. The lysine riboswitch also 
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level. Water molecules are shown as light blue spheres. K* cation 
coordination and hydrogen bonds are depicted by dashed lines. ¢, Direct and 
water-mediated interactions involving c-ammonium group of lysine. 

d, e, Interactions in the top (d) and middle (e) junctional layers. 


discriminates against smaller amino acids that fit into the pocket 
(data not shown) but are unable to make essential intermolecular 
contacts in the vicinity of G80. 

To gain insights into lysine-induced conformational rearrange- 
ments of the riboswitch in solution, we performed footprinting 
experiments using in-line probing, specific for flexible RNA regions, 
and cleavage by the nucleases T2 (single-stranded RNA) and V1 
(paired and stacked regions). As in B. subtilis'’, the transition from 
the free to the lysine-bound states of the T. maritima riboswitch is 
accompanied by conformational changes within the junctional core, 
detected as strong cleavage reductions in both in-line and V1 probing 
(Fig. 4d-f and Supplementary Fig. 13). However, the kissing loop 
complex is probably preformed in the free riboswitch>”’, as evident 
from the absence of cleavages in in-line’* (Fig. 4d) and T2 probing 
(Supplementary Fig. 13b), coupled with V1 cleavages at nucleotides 
45-46 (Fig. 4e), which are only weakly enhanced by lysine binding. 
The P2-L4 tertiary contact seems to be more dynamic in character as 
reflected in the rather strong T2 (nucleotides 126-127, Fig. 4f) clea- 
vage patterns in the lysine-free form, coupled with only weak pro- 
tection on complex formation. The projection of the in-line cleavage 
reductions (this study and ref. 1) on the structure (Fig. 4g) provides 
the first glimpses into the primary determinants of lysine recognition 
and how the riboswitch junction folds on ligand binding. Assuming 
that helix P1 is not fully formed before lysine binding and that pro- 
tections in P5 are, at least in part, due to P1—P5 interactions, we 
propose that the main structural changes in solution, seen in both 
studies, involve stabilization of G80 and formation of the G12-C79 


Figure 3 | Interactions of lysine analogues with the riboswitch. a, Chemical 
structures of lysine and its analogues. b, Conformation and interactions of 
lysine analogues with the riboswitch. Lysine (red) and analogues (cyan) are 
superposed. Interactions between riboswitch and lysine analogues should be 
compared with lysine recognition in Fig. 2b. c, Cross-section through the 
surface view of the lysine-binding pocket showing the opening next to the 
carboxyl group of lysine (red arrow) and the free space next to the C4 atom 
(blue arrow). d, Cross-section through the homoarginine—riboswitch 
complex showing the opening (red arrow) next to the guanidinium group. 
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and G11°G163 junctional base pairs, followed by stabilization of the 
surrounding regions and the P1 helix (Supplementary Fig. 14). 
Prompted by pre-formation of tertiary riboswitch elements in 
solution, we have crystallized the riboswitch in the absence of lysine. 
This 3.1 A structure (Supplementary Fig. 15) is very similar to the 
lysine-bound form, except that it lacks lysine and junctional K*. 
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Figure 4 | Probing lysine riboswitch tertiary structure. a, Primer extension 
analysis at various K* concentrations. **P-labelled oligonucleotide was 
annealed to the 3’ end of 265-nucleotide T. maritima RNA (1 UM) and 
extended by reverse transcriptase in the presence of 2mM MgCh, at 
indicated concentrations of monovalent cations (in mM), and with or 
without a tenfold excess of lysine over RNA. b, Lysine binding affinity 
measured by equilibrium dialysis for riboswitches from T. maritima (top) 
and B. subtilis (bottom). Mg*', K* and Na’ concentrations are 20, 100 and 
100 mM, respectively. The dissociation constants (mean + s.d., UM; 

n= 2-4) are: T. maritima, 0.10 + 0.03 (Mg”* + K*), 4.14 + 0.67 

(Mg”* + Na‘), 15.93 + 0.09 (Mg”*); B. subtilis, 2.95 + 0.30 (Mg** + K*). 
c, The apparent ligand concentration at which reverse transcriptase pausing 
is half-maximally attained (P59; n = 2) in the primer extension experiments. 
HArg, L-homoarginine; IEL, iminoethyl-.-lysine; Lys, L-lysine; Oxa, L-4- 
oxalysine. d, In-line probing of 5’ *’P-labelled T. maritima 174-nucleotide 
RNA (1.6 nM) in the absence and presence of lysine (1.1 uM). Tl and OH 
designate RNase T1 and alkaline ladders, respectively. NR, no reaction. 
Strong and weak cleavage reductions are shown in red and pink colours, 
respectively. e, f, Probing of 174-nucleotide RNA (1 uM) by V1 and T2 
nucleases with or without a tenfold excess of lysine. Weak and strong 
cleavage enhancements are shown in light and dark green, respectively, in 
e. g, Strong lysine-induced cleavage reductions are colour coded in the 
riboswitch structure. Light orange, green and red are reductions identified in 
ref. 1 using B. subtilis RNA with a short P1 helix, overlapping reductions of 
the present study and in ref. 1, and extra reductions found in the present 
study, respectively. 
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Although the absence of the expression platform and long P1 helix 
facilitate formation of this conformation, the structure emphasizes 
the importance of RNA interactions in maintaining the riboswitch 
conformation, suggests a crucial role of K* in mediation of lysine- 
RNA but not RNA-RNA interactions, and reinforces the feasibility of 
lysine stabilizing a largely preformed riboswitch structure. 

The L box structure readily explains mutations that deregulate gene 
expression and confer resistance to AEC”””* (Supplementary Fig. 16). 
The G12A, G12C and G81A mutations disrupt the lysine-binding 
pocket, whereas the G11A, G11U, G9C and C166U substitutions pre- 
vent pairing of the P| helix. Therefore, intracellular lysine and AEC 
cannot bind the mutated riboswitches, and the segment downstream 
of Gl61 engages in formation of an anti-terminator stem 
(Supplementary Fig. 1), resulting in constitutive lysine production. 

The unusual architecture and high ligand specificity, achieved 
through a combination of shape complementarity and K* -assisted 
recognition of the bound lysine, distinguishes the lysine riboswitch 
from other riboswitches. Given the importance of lysine riboswitch- 
controlled gene expression for bacterial viability and the absence of 
the diaminopimelate pathway in mammals, the structure provides 
critical details towards facilitating the design of lysine-like analogues 
targeting riboswitches and other cellular sites. 


METHODS SUMMARY 

Crystallization and structure determination. A 0.4 mM lysine riboswitch com- 
plex was prepared by mixing in vitro transcribed RNA and lysine in a buffer 
containing 100 mM potassium acetate, pH 6.8, and 4mM MgCl,. Crystals were 
grown by hanging-drop vapour diffusion after mixing the complex and the 
reservoir (18% (w/v) PEG4000, 100 mM sodium citrate, pH 5.7, and 20% iso- 
propanol (v/v)) solutions at 1:1 ratio. For soaking, crystals were placed in the 
reservoir solution with PEG4000 replaced by 20% PEG400, and then incubated 
in the presence of either 3mM [Ir(NH3)¢]°* or 10-50mM of Cs*, TI* and 
Mn’* salts for 7h. Crystals were flash frozen in liquid nitrogen and data were 
collected at 100K. The structure was determined using 24A multiwavelength 
anomalous dispersion iridium data and SHARP*. The RNA model was built 
using TURBO-FRODO (http://www.afmb.univ-mrs.fr/-TURBO-), and refined 
using the 1.9A native data set to Ryork/ Riree 19.2/22.9 (Supplementary Table 1 
and Supplementary Fig. 17). Lysine and cations were added to the model on the 
basis of analysis of 2F,— F., F,— F. and anomalous electron density maps. 
Cations were modelled on the basis of the number of coordination bonds, their 
distances, coordination geometry and temperature factors. Analogue-bound 
and metal-soaked riboswitch structures were refined using the native riboswitch 
model (Supplementary Tables 2 and 3). Figures were prepared with PyMol 
(http://www.pymol.org). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

RNA preparation and complex formation. The lysine riboswitch, followed by 
the hammerhead ribozyme, was transcribed in vitro using T7 RNA polymerase. 
RNA was purified by denaturing polyacrylamide gel electrophoresis (PAGE) and 
anion-exchange chromatography. Lysine analogues were added to RNA at a 2.5— 
2.75 to 1 molar ratio. To form a complex without Mg”*, the RNA was mixed with 
100mM potassium-acetate, 1.0mM EDTA and lysine. To prepare RNA for 
crystallization without lysine, 0.2 mM RNA was supplemented with 50 mM pot- 
assium acetate, pH 6.8, 50 mM sodium acetate, pH 6.9, and 2mM MgCh, and 
concentrated twofold by Speedvac. Before crystallization, sodium-citrate, 
pH5.7, was added to the mixture up to 100 mM, and the RNA sample was heated 
at 55 °C for 5 min and cooled on ice for 15 min. 

Crystallization. Hanging drops were prepared by mixing 1 il of the complex 
with 1 pl of the reservoir solution. The drops were equilibrated against 1 ml of 
reservoir solution at 20°C for ~ 1-2 weeks. The riboswitch in the free state was 
crystallized in the solution containing 21% (w/v) PEG4000, 100 mM Bis-Tris, 
pH5.5, and 25% isopropanol (v/v). For cryoprotection, crystals were quickly 
passed through the stabilizing solution, which was the reservoir solution with 
PEG4000 replaced by 20% PEG400. For soaking, crystals were passed through 
several 5 pl drops of the stabilizing solution, and then incubated in 5 ul of sta- 
bilizing solution supplemented with 3 mM [Ir(NH3).¢]Cl;, 10 mM CsCl, 10 mM 
thalium acetate, or 50 mM MnCl) salts for 7—8 h. 

X-ray crystallography. Data were reduced using HKL2000 (HKL Research). The 
structure was determined using the autoSHARP option of SHARP and 2.4A 
MAD iridium data. The resulting experimental map was of excellent quality 
(Supplementary Fig. 17a) for most of the RNA molecule. The structure contains 
1 RNA and 16 iridium hexamine sites (2 of them are split) per asymmetric unit 
(Supplementary Fig. 17b). The RNA model was built using 2.4 A MAD electron 
density map, and then refined with REFMAC”! using 1.9A native data. The 
bound lysine and several cations with octahedral coordination geometry were 
added on the basis of the 2F, — F. and F, — F. electron density maps. Cations 
were interpreted as Na’ or Mg’* on the basis of the coordination distances in the 
range of 2.25-2.85A (Supplementary Fig. 17d) and 2.0-2.3A, respectively. 
Water molecules were added using ARP/wARP*. K* cations were added on 
the basis of the 1.9A 2F,—F. and 2.94 anomalous (Supplementary Fig. 6) 
electron density maps and typical K* coordination distances (Supplementary 
Fig. 17c). The final 1.9A riboswitch model contains 174 nucleotides, 1 lysine 
molecule, 1 magnesium cation, 3 potassium and 29 sodium cations. Cst. TL: 
and Mn?" cations were modelled on the basis of the anomalous maps 
(Supplementary Figs 7-9), whereas addition of other cations was guided by 
the high-resolution structure and the analysis of coordination geometries and 
distances. 

Primer extension assay. Primer extension experiments were performed using 
265-nucleotide full-length riboswitch. The *’P-labelled 13-mer DNA oligonu- 
cleotide (100,000 c.p.m.), complementary to nucleotides 253-265 of RNA, was 
annealed to the 3’ end of RNA (final RNA concentration 1 11M in the assay). 


nature 


Primer extension was conducted in 15 pl volume with 40 U of moloney murine 
leukaemia virus reverse transcriptase in 50mM Na-HEPES, pH7.9, 2mM 
MgCl,, and variable concentrations of NaCl and KCl (Fig. 4), with or without 
a tenfold excess of lysine over RNA. After 30 min incubation at 37 °C, the reac- 
tions were precipitated with ethanol, dissolved in loading buffer and analysed by 
10% PAGE. Efficiency of lysine-induced pausing of reverse transcriptase at nuc- 
leotide A169 was quantified using FLA-7000 PhosphorImager and Image Gauge 
software (Fujifilm). Band intensities from independent experiments were aver- 
aged after gel-loading correction, background subtraction and normalization. 

The primer extension assay in the presence of lysine analogues was performed 
in 50 mM Na-HEPES, pH 7.9, 2mM MgCl, 50 mM KCl, and lysine analogues in 
the range 10 8-10 *M. The data were fitted using a bimolecular equilibrium 
equation (Supplementary Fig. 12) and the resulting P59 values are reported in 
Fig. 4c. Reverse transcriptase sequencing reactions were run in parallel. 
Footprinting studies. For footprinting experiments, the 174-nucleotide meta- 
bolite-sensing domain of the riboswitch was radioactively labelled at the 5’ end 
by the kinase reaction. For in-line probing, 30,000-300,000c.p.m. of 174- 
nucleotide RNA (1.6—16nM) was incubated in 10-30 pl solution containing 
50mM Tris-HCl, pH 8.3, 100mM KCI and 20mM MgCl, in the absence or 
presence of ~6—600-fold excess of lysine (Fig. 4) at room temperature for 
~40h. After incubation, aliquots were analysed by PAGE along with alkaline 
ladder and T1 nuclease digestion. 

For nuclease footprinting experiments, 20 tl samples of 174-nucleotide RNA 

(100,000 c.p.m.) with a final RNA concentration 1 11M were preheated at 37 °C 
for 10min in 50mM Na-HEPES, pH7.9, 50mM KCl and 2mM MgCh. 
Mixtures were incubated with a tenfold excess of lysine over RNA at 37 °C for 
15 min. Cleavage reactions were performed with 0.0025 U RNase V1 (Pierce) or 
0.2 U RNase T2 (Sigma) at 37°C for 10 min. Reactions were quenched by the 
addition of 80 l cold buffer and were immediately extracted with phenol- 
chloroform and precipitated with ethanol. Radiolabelled RNA products were 
dissolved and analysed by PAGE. 
Equilibrium dialysis. The assay was performed as described in ref. 1 using 5 kDa 
DispoEquilibrium DIALYZERS (Harvard Apparatus). In brief, 30 ul RNA (from 
0.001 to 20 1M) in the in-line probing buffer was placed in chamber A of the 
dialyser and equilibrated for 16h at room temperature against chamber B con- 
taining 30 ull °H-labelled lysine (1 nM; ~6,000 c.p.m.) in the same buffer. The 
amount of bound lysine was calculated by subtracting the radioactivity counts of 
chamber B from chamber A. The data were fitted using a bimolecular equilib- 
rium equation, assuming that the free lysine concentration is negligible. 


31. Murshudov, G. N., Vagin, A. A. & Dodson, E. J. Refinement of macromolecular 
structures by the maximum-likelihood method. Acta Crystallogr. D 53, 240-255 
(1997). 

32. Perrakis, A., Morris, R. & Lamzin, V. S. Automated protein model building 
combined with iterative structure refinement. Nature Struct. Biol. 6, 458-463 
(1999). 
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Concurrent nucleation of 16S folding and induced fit in 


30S ribosome assembly 


Tadepalli Adilakshmi't, Deepti L. Bellur*+ & Sarah A. Woodson’ 


Rapidly growing cells produce thousands of new ribosomes each 
minute, in a tightly regulated process that is essential to cell 
growth’’. How the Escherichia coli 16S ribosomal RNA and the 
20 proteins that make up the 30S ribosomal subunit can assemble 
correctly in a few minutes remains a challenging problem, partly 
because of the lack of real-time data on the earliest stages of assem- 
bly. By providing snapshots of individual RNA and protein inter- 
actions as they emerge in real time, here we show that 30S assembly 
nucleates concurrently from different points along the rRNA. 
Time-resolved hydroxyl radical footprinting’ was used to map 
changes in the structure of the rRNA within 20 milliseconds after 
the addition of total 30S proteins. Helical junctions in each 
domain fold within 100 ms. In contrast, interactions surrounding 
the decoding site and between the 5’, the central and the 3’ 
domains require 2-200 seconds to form. Unexpectedly, nucleo- 
tides contacted by the same protein are protected at different rates, 
indicating that initial RNA-protein encounter complexes refold 
during assembly. Although early steps in assembly are linked to 
intrinsically stable rRNA structure, later steps correspond to 
regions of induced fit between the proteins and the rRNA. 

It has been previously shown that hierarchical addition of ribo- 
somal proteins to the 16S rRNA produces cooperativity* that is due to 
protein induced structural changes in the 16S rRNA, rather than to 
direct contacts between proteins’. Because the rRNA becomes more 
structured as proteins join the complex, assembly is coupled to the 
folding pathway of the rRNA®’. The simplest kinetic model for 30S 
assembly is that regions of the 16S rRNA contacted by primary 
assembly proteins fold first, whereas helices stabilized by tertiary 
assembly proteins fold last. If assembly is strictly sequential, each 
subdomain of the rRNA will fold within a distinct time, producing 
a limited set of intermediate complexes. 

Alternatively, the rate of protein binding may initially depend on the 
stochastic probability of forming locally stable rRNA and protein inter- 
actions, with progression to later intermediates depending on propaga- 
tion of this conformational order to neighbouring regions in the rRNA. 
If more than one region of the naked rRNA can fold, assembly is 
expected to nucleate from many places at once, producing an ensemble 
of reconstitution intermediates and multi-stage assembly kinetics’. 

To visualize the intermediates of 30S ribosome assembly, the struc- 
ture of the 16S rRNA was probed by time-resolved X-ray hydroxyl 
radical footprinting (Fig. 1). The extent of RNA cleavage correlates 
with backbone exposure’. Thus, this method probes individual ter- 
tiary contacts in the rRNA as well as protein interactions that bury the 
rRNA backbone. Previous efforts to map the conformational changes 
in the 16S rRNA during 30S ribosome assembly used low temper- 
ature or subsets of proteins to stall assembly at specific stages'*''. We 
took advantage of the exceptional time resolution of synchrotron 


X-ray footprinting (~10 ms) to resolve the very early stages of assem- 
bly in real time, without needing to stall the reaction. 

At the start of the experiment, native 16S rRNA from E. coli was 
prefolded at 42°C in standard reconstitution buffer (see Methods; 
Fig. 1a) because this pretreatment improved the quality of recon- 
stitution (Supplementary Fig. 2). Assembly of 30S ribosomal sub- 
units was initiated at 30 °C by mixing the rRNA with native total 30S 
proteins (TP30) using a rapid quench apparatus (see Supplementary 
Methods). The rRNA-protein mixture was exposed to a white syn- 
chrotron X-ray beam for 10 ms to generate hydroxyl radicals and to 
cleave the exposed parts of the rRNA backbone’. The extent of cleav- 
age at each position in the 16S rRNA was analysed by primer exten- 
sion and compared to parallel reactions on naked 16S rRNA and 
native 30S ribosomes (Fig. 1b). 

Many regions of the prefolded 16S rRNA were strongly protected 
from cleavage, indicating that the rRNA has native tertiary structure 
in the absence of proteins. This is consistent with chemical footprint- 
ing experiments”'®’” and with neutron scattering data showing that 
the deproteinized 16S rRNA has roughly the same dimensions as it 
does in the 30S ribosome”. 

After a 3 min incubation with 30S proteins, the cleavage pattern 
was similar to that of native 30S ribosomes (Fig. 1b), and correlated 
well with the solvent accessibility of the C4’ atoms predicted from 
crystal structures of the Thermus thermophilus 30S (ref. 14) or E. coli 
70S ribosomes’*. Reconstitution of 30S subunits was confirmed by 
sedimentation velocity and activity assays (Supplementary Fig. 2). 
Further rearrangements after 3 min may be needed to produce fully 
active subunits, as the temperature-dependent transition from the RI 
reconstitution intermediate to the RI* activated reconstitution inter- 
mediate late in assembly is slow at 30 °C (ref. 16), and is associated 
with changes in the accessibility of the nucleotide bases rather than 
the backbone'’. Nonetheless, most of the expected RNA and RNA- 
protein contacts formed within 3 min. 

The kinetics of rRNA backbone protection showed the presence of 
many early assembly intermediates. Some nucleotides in the 16S 
rRNA were completely protected from hydroxyl radical cleavage 
within the 20ms dead time of our experiment, whereas others 
required 1-3 min to be fully protected (Fig. 1c and Supplementary 
Table 1). For many nucleotides, a partial burst of protection in the 
first 50-100 ms was followed by slower saturation of the contact over 
the next few minutes (Fig. 1c and Supplementary Fig. 3). 

The multiphasic folding kinetics strongly indicate that individual 
30S complexes take different routes to the final structure, as sug- 
gested by previously reported rates of protein binding” and by the 
kinetic partitioning of simple ribozymes among parallel folding path- 
ways'*. In kinetic footprinting experiments on the Tetrahymena ribo- 
zyme, partial protection of the RNA backbone at intermediate times 


'T.C. Jenkins Department of Biophysics, and *Program in Cell, Molecular and Developmental Biology and Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, 
Maryland 21218-2685, USA. +Present address: Weis Center for Research, Geisinger Medical Center, 100 North Academy Avenue, Danville, Pennsylvania 17822, USA (T.A.); 
Department of Molecular Genetics and Cell Biology, The University of Chicago, 920 East 58th Street, Chicago, Illinois 60637, USA (D.L.B). 
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Figure 1| Time-resolved X-ray footprinting of 30S ribosome assembly. 

a, Native 16S rRNA and total protein from 30S subunits (TP30) were mixed 
within 5-10 ms and irradiated with a synchrotron X-ray beam to cleave the 
RNA at exposed riboses. b, 16S fragments were analysed by primer 
extension. The cleavage pattern shown was from 0.02 to 180s after TP30 
addition. 30S, triplicate controls on native 30S subunits; RNA, prefolded 16S 
rRNA; UCAG, dideoxy sequence ladders; —, untreated RNA. The primer 
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annealed after nucleotide 1,257. c, The relative saturation (Y) of each 
protection versus assembly time, fitted to single or double exponential rate 
equations (Supplementary Methods). Filled circles, nucleotide (nt) 398; 
filled squares, nucleotides 617-618; filled triangles, nucleotide 263; open 
circle and plus symbol, the average of 30S controls. Further data are shown in 
Supplementary Fig. 3 and Supplementary Table 1. 
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Figure 2 | Simultaneous folding of 16S domains. a, Protected nucleotides 
(740 positions) were clustered according to the rate constant for backbone 
protection and coloured as indicated in the key. Where the amplitude of the 
initial burst phase is =60%, the slower rate constant is used. Grey (120 
positions) indicates the rate constant was undetermined (ND) owing to a 
pause in reverse transcription or to weak protection. Natural adenine 


methylation hampered quantitative analysis of residues in the decoding site. 
Circles, RNA—RNA contact; squares, RNA_ protein contact; open circles, 
predicted C4’ accessible surface area > 4 A?; solid symbols, buried C4’ in 
crystal structures. b, Three-dimensional ribbon of E. coli 16S rRNA (PDB 
accession 2avy)'° coloured as in a, viewed from the 50S interface. See 
Supplementary Fig. 4 for further views of each domain. 
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was explained by the contemporaneous formation of folding inter- 
mediates with different structures'’. We observed no exposure of 16S 
residues that might indicate the disappearance of non-native assem- 
bly intermediates over time. However, such interactions may have 
been too dynamic or heterogeneous to produce a distinct footprint. 

To visualize the assembly pathway of the rRNA, the observed rate 
constants for making individual backbone contacts were clustered, 
and the clusters were then projected onto the secondary and tertiary 
structure of the mature 16S rRNA (Fig. 2). This locates the interactions 
formed at each stage, even though the intermediate structures may 
differ from the mature structure. Nucleotides with similar rates of 
backbone protection did not map to single domains, but were distri- 
buted throughout the 16S rRNA. In general, nucleotides that were 
protected in the first 20-50 ms (red; Fig. 2) were partially folded in 
the naked rRNA or were protected by a local structure such as a kink or 
a helical junction. Helices contacted by the primary assembly proteins 
S4(5’ domain), $7 (3' domain), $8 and S15 (central domain) were also 
protected within 50 to 100 ms after the proteins and rRNA were mixed. 
Thus, each domain assembles independently and simultaneously. 

In contrast, the messenger RNA decoding site and long-range 
interactions between domains required the longest time to form 
(blue; Fig. 2). For example, 16S helix H21 extends from the central 
domain to wrap around the body of the 30S subunit, where it con- 
tacts proteins $4 and S16. Interactions with H21 required 1 min to 
saturate completely (0.02-0.1s—'). The slow appearance of rRNA 
backbone contacts at G530 (H18), and the central pseudoknot 
(H2) in the decoding site, is consistent with previous equilibrium 
studies demonstrating that structural changes in the central pseudo- 
knot are linked to the transition from the RI to RI* intermediates late 
in 30S assembly”’. 

The interplay between rRNA interactions that are intrinsically 
stable and those that are protein-dependent is illustrated by the kin- 
etics of RNA and RNA-protein interactions in the body of the 30S 
ribosome (Fig. 3a). H44 (3’ minor domain) lies along one face of the 
body, packing against the stable tertiary structure formed by H7—H10 
and H14 (ref. 20). Interactions with the distal tip of H44 are mediated 
by protein S20, which inserts its carboxy-terminal «-helix behind H8 
and H14 in the 5’ domain*’. Many RNA tertiary interactions in the 
body of the 30S ribosome form within 20 ms (red; Fig. 3a), whereas 
interactions with $20 form in 0.5 to 3s (green), and long-range con- 
tacts with H44 form in ~10s (blue; Fig. 3a). The multi-stage assem- 
bly of these RNA and protein interactions is consistent with the 
results of hydroxyl radical probing from Fe-EDTA complexes teth- 
ered to S20 (ref. 22). 

We next addressed whether the rates of RNA-protein interactions 
correlated with the position of each protein in the assembly map. 
Protections arising from direct protein—rRNA contacts were iden- 
tified from crystallographic structures of the 30S ribosome’! and 
previous footprinting of individual proteins” (see Supplementary 
Methods). Four of the six primary assembly proteins (S4, S7, S8 and 
S15) protected a segment of their binding site during the first 50 ms 
of assembly. This was only true of three of the nine secon- 
dary assembly proteins (S16, S9 and S10) and none of the tertiary 
assembly proteins (Supplementary Table 2). Thus, primary assembly 
proteins more frequently bind to the rRNA early in assembly, in 
agreement with pulse-chase measurements of protein binding 
kinetics”. 

Unexpectedly, nucleotides contacted by a single protein were pro- 
tected from hydroxyl radical cleavage with different rate constants 
(Fig. 4). For example, protein S4 binds a five helical junction in the 5’ 
domain, and initiates 30S assembly together with protein S7 (ref. 24). 
The interactions between S4 and 16S H17 (nucleotides 436-441) 
saturated in 20—50 ms, whereas those in H3, H16 and H18 formed 
in 0.3—2 s, and those with H21 (nucleotide 620) appeared over 8-10 s 
(Fig. 4a). S7 binds a complex helical junction in the 16S 3’ domain 
(Fig. 4b)?!. H43 (nucleotides 1,369-1,377) was protected in 20- 
50 ms, whereas contacts with H29, H41 and the distant H37 were 
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protected more slowly. Thus, S4 and S7 engage their binding sites in 
stages rather than in one step. Similar behaviour was observed for 
proteins S8, S9, S10, S15 and S16 (Supplementary Figs 4 and 5). 

The unexpectedly complex kinetics of the rRNA-protein interac- 
tions can be explained by the formation of encounter complexes that 
slowly reorganize into the final complex. For S4, the variable kinetics 
of rRNA backbone protection may be due to co-folding of the protein 
and the rRNA”. H17, which is protected rapidly, contacts the well- 
folded C-terminal domain of $4 (ref. 21). In contrast, H18 and the 
top of H16, which are protected at an intermediate rate, contact the 
$4 amino-terminal domain, which is disordered in solution”®. 

If different regions of the 16S rRNA fold and interact with the 
ribosomal proteins simultaneously, the implications for the hier- 
archy of protein association represented by the assembly map remain 
a challenge. The association rates suggested by the fastest rates of 
RNA backbone protection approach the diffusion-controlled limit 
(109M 's_'). However, the slowest protection rates are most 
similar to the protein binding rates measured by pulse-chase mass 


Figure 3 | Stepwise assembly of RNA and protein interactions. a, Protein 
S20 (yellow ribbon) contacts the 30S body in the 5’ domain (grey) earlier 
than helix H44 in the 3’ minor domain (pink). 16S nucleotides are coloured 
as in Fig. 2. b, Proteins S7 (yellow) and S9 (green) protect a segment of their 
binding site immediately (red), whereas nucleotides at the interface between 
the subdomains are protected slowly (blue). 
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Figure 4 | Ribosomal proteins interact with the rRNA in stages. 

a, b, Kinetics of direct rRNA backbone protection by proteins $4 (magenta) 
and S7 (yellow), coloured as in Fig. 2. Schematic symbols are also as in Fig. 2. 
c, d, Progress curves for protection of individual residues in contact with $4 
and S7; for clarity, only fitted curves are shown (see data in Supplementary 
Fig. 5). The relative saturation (Y) of each protection versus assembly time is 
shown. See Supplementary Methods for definition of RNA-protein contacts. 


spectrometry’” (Supplementary Table 2). Our data indicate that slow 
forming interactions in the rRNA, many of which are indirectly sta- 
bilized by bound proteins, determine the rate of protein addition and 
the hierarchy of assembly. 

This idea is illustrated by interactions of protein S7, which is neces- 
sary for stable binding of other proteins to the 16S 3’ domain**”’. 
Residues indirectly protected by S7 (ref. 23), which overlap the bind- 
ing sites of the secondary assembly proteins $9, $13 and S19, fold 
slowly (Supplementary Fig. 5), explaining why these proteins join 
30S complexes more slowly than S7 (ref. 17). However, residues in 
H34 that directly contact S9 and S10 are fully protected in 50 ms 
(Fig. 3b and Supplementary Fig. 5). Thus, $7, S9 and S10 all engage 
their binding sites in the rRNA early in assembly, but productive 
incorporation of S9 and S10 presumably waits for conformational 
changes in the 16S rRNA induced by S7. An important question is 
whether early stochastic interactions between the ribosomal proteins 
and the rRNA backbone bias the ensemble of rRNA conformations 
towards the native state, thus accelerating self-assembly, or whether 
these interactions delay the search for the native conformation. 

Stable structure in the 16S rRNA allows the concurrent nucleation 
of assembly from many points along the rRNA, resulting in the see- 
mingly chaotic but rapid appearance of native interactions through- 
out the complex. Previous studies indicated that ribosome assembly is 
not completely cooperative’*, demonstrating the need for several nu- 
cleation sites’. The lack of complete cooperativity, and the differences 
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between the time-dependence of 16S folding and the assembly map 
revealed by kinetic footprinting of the nucleotide bases’, support the 
conclusion that assembly proceeds in parallel through intermediates 
with different subsets of proteins®®. In vivo, assembly of the pre-rRNA 
during transcription is more cooperative and involves specific access- 
ory factors®*. Co-transcriptional assembly may simplify the pathway 
by limiting the number of intermediates that can be populated. 
Nonetheless, we expect stochastic fluctuations among alternative 
intermediates to contribute to the early pathway of ribosome assembly 
in vivo. 


METHODS SUMMARY 


Native E. coli 16S rRNA was prefolded in 80 mM K-cacodylate, pH 7.5, 330 mM 
KCI, 20 mM MgCl, for 15 min at 42 °C, then mixed with an equal volume of E. 
coli TP30 in the same buffer at 30 °C, using a Kin-Tek rapid quench apparatus 
fitted with an X-ray flow cell. Between 20% and 40% of the input RNA was 
cleaved after 10 or 20 ms exposure to a white light synchrotron beam (X28C, 
National Synchrotron Light Source (NSLS), Brookhaven National Laboratory). 
The cleaved RNA was analysed by primer extension with reverse transcriptase. 
The increase in relative backbone protection (Y) after the addition of TP30 was 
fitted to rate equations to obtain the observed rate constants and amplitudes for 
the reaction (see Supplementary Fig. 3 and Table 1). Individual protections were 
assigned to specific RNA or RNA-protein interactions on the basis of crystal- 
lographic structures (Protein Data Bank accessions 1j5e (ref. 14), 2avw and 2avy 
(ref. 15)); see Supplementary Methods for details. 
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uch like every other industry, the biotechnology sector is bracing 

itself for a difficult ride in the present financial crisis. That the 

industry has suffered hard times before may give it an advantage, 

but it is still likely to find the squeeze on cash difficult, and the 
knock-on effects for jobs could put some regions at a disadvantage. 

David Mott, a venture capitalist at New Enterprise Associates in Chevy Chase, 
Maryland, describes the current investment landscape for the biotech sector as 
somewhat desolate and dried up. Without ample investment, existing biotech 
companies will be harder to sustain, and new firms will be trickier to set up. As a 
result, it will be harder to generate jobs in the sector. 

But Mott, the former chief executive of MedImmune, a biotech company bought 
by AstraZeneca in April 2007 for $15.6 billion, suggests that the industry is well 
prepared to survive — if only because it has suffered already. The last big biotech 
liquidity crunch was in the early and mid 1990s. “There was very little capital 
available to build even very good companies with very good management teams,” 
says Mott. That means many people in the sector have already been through dry 
spells in the capital markets, he notes. “That experience will come in handy.” 

Perhaps more importantly, the biotech sector has already adapted to a low- 
liquidity environment. Financing models for the area were forced to evolve after 
the technology bubble burst in 2000 and public offerings more or less dried 
up. Biotech investors and founders are no longer attempting to build the next 
Amgen or Genentech. Instead, they are building companies to sell, usually hoping 
big pharmaceutical firms will snap them up. This, says Mott, makes many of 
today's biotech concerns ‘virtual businesses’ — they are increasingly outsourcing 
chemistry and clinical-trials management to places such as India. 

The deepening economic crisis is likely to accentuate this trend. So it could be 
good news for Indian scientists who are seeking biotech-related jobs, but it may be 
less welcome for those in the United States and Europe. 


Gene Russo is editor of Naturejobs. 


CONTACTS 
Editor: Gene Russo 


European Head Office, London 

The Macmillan Building, 

4 Crinan Street, London N1 9XW, UK 
Tel: +44 (0) 20 7843 4961 

Fax: +44 (0) 20 7843 4996 

e-mail: naturejobs@nature.com 


European Sales Manager: 

Andy Douglas (4975) 

e-mail: a.douglas@nature.com 
Natureevents: 

Ghizlaine Ababou (+44 (0) 20 7014 4015) 
e-mail: g. ababou@nature.com 

UK Corporate: 

Nils Moeller (4953) 

Southwest UK/RoW: 

Alexander Ranken (4944) 

Northeast UK/Ireland: 

Matthew Ward (+44 (0) 20 7014 4059) 


France/Switzerland/Belgium: 

Muriel Lestringuez (4994) 
Scandinavia/Spain/Portugal/Italy: 
Evelina Rubio-Hakansson (4973) 

North Germany/The Netherlands/Eastern 
Europe: Reya Silao (4970) 

South Germany/Austria: 

Hildi Rowland (+44 (0) 20 7014 4084) 


Advertising Production Manager: 

Stephen Russell 

To send materials use London address above. 
Tel: +44 (0) 20 7843 4816 

Fax: +44 (0) 20 7843 4996 

e-mail: naturejobs@nature.com 

Naturejobs web development: Tom Hancock 
Naturejobs online production: Dennis Chu 


US Head Office, New York 
75 Varick Street, 9th Floor, 
New York, NY 10013-1917 
Tel: +1 800 989 7718 


Fax: +1 800 989 7103 
e-mail: naturejobs@natureny.com 


US Sales Manager: Peter Bless 


India 
Vikas Chawla (+91 1242881057) 
e-mail: v.chawla@nature.com 


Japan Head Office, Tokyo 

Chiyoda Building, 2-37 Ichigayatamachi, 
Shinjuku-ku, Tokyo 162-0843 

Tel: +813 3267 8751 

Fax: +813 3267 8746 


Asia-Pacific Sales Manager: 

Ayako Watanabe (+813 3267 8765) 
e-mail: a.watanabe@natureasia.com 
Business Development Manager, Greater 
China/Singapore: 

Gloria To (+852 2811 7191) 

e-mail: g.to@natureasia.com 


1273 


© 2008 Macmillan Publishers Limited. All rights reserved 


REGIONS 


Despite its diminutive size and modest reseai 
a major contribution to research in the physi¢ 


wenty-nine year old American astronomer 

Ryan Quadri was at a point in his career 

when he found the idea of living abroad 

appealing. After earning his PhD from Yale 
University last year, Quadri, who studies the formation 
of galaxies in the early Universe, considered various 
destinations. He preferred Europe, and took advantage 
of a connection he had with a Dutch astronomer. 
“Living in another country was my main motivation,’ 
he says. “But I did know, of course, that Leiden is a 
great place for astronomy.’ So last November, Quadri 
relocated to Leiden, a lively university town an houtr’s 
train ride south of Amsterdam, to do three years of 
postdoctoral research. He has no regrets. 

Quadri says it was surprisingly easy to get used to 
working and living in the Netherlands, a small place 
with strong scientific hubs in cities such as Leiden and 
Utrecht. The country has long been known for a 
welcoming attitude towards research and a history of 
encouraging scientific discourse. Scientists there have 
learned to forge international collaborations rather 
than be confined by the resources of a small country. 
Quadri also found that the lab culture was very similar 
to the one he had left behind. 

The Netherlands boasts significant achievements in 
astronomy and other physical sciences, given its size. In 
August, researchers at Dutch institutions won nine out 
of 105 advanced grants in the physical sciences from 
the European Research Council (ERC), making the 
Netherlands the third most successful nation — behind 
Britain and France and on par with Italy and 
Switzerland — in this round of the competition. 
Researchers in Germany, which is five times more 
populous than the Netherlands, have won only seven of 
these prestigious new grants. 

“When I moved to the Netherlands in 2001, I was 
really impressed by the general quality of physics in this 
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country,’ says Michel Orrit, a French condensed- 
matter physicist at the University of Leiden, who has 
won an €1.8-million (US$2.4-million) ERC grant for 
his work on single-molecule spectroscopy. 

And yet the Netherlands’ overall spending on 
science, some 1.8% of gross domestic product, is far 
below that of its eastern neighbour and even slightly 
below the European Union (EU) average. 

“Call it the Dutch paradox,” says Peter Nijkamp, 
chairman of the Netherlands Organisation for 
Scientific Research (NWO), the main funding agency. 
Nijkamp suggests that the Netherlands has managed to 
create very favourable conditions for science despite 
“regrettably modest public research budgets” 


J. HICKS/CORBIS 


Acity of science 

Leiden is one of the country’s centres for physical 
sciences. With a population of 117,000, it lies in the 
Holland region, a highly urbanized ring in the west of 
country and one of the most densely populated areas 
of the world. An old windmill, quiet canals, flocks of 
cyclists and the odd coffee shop remind one that this is 
the land of Vincent van Gogh, cyclist Joop Zoetemelk 
and pragmatic cannabis policies. 

But just below Leiden’s tourist-pleasing surface is a 
rich and active tradition of science. The Leiden 
Bioscience Park is one of the oldest and largest in 
Europe. The university, where the seventeenth-century 
astronomer Christiaan Huygens studied law and 
mathematics, also hosts the country’s largest and 
best-equipped astronomy department. Most research 
takes place in international collaboration, thus 
affording privileged access to telescopes, data and 
computing facilities. And thanks to a generous grant 
from the Netherlands Research School for Astronomy, 
Quadri can also travel to conferences and observatory 
sites around the world as often as necessary. 
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Indeed, astronomy is one of the country’s prime 
science strengths. Dutch space researchers often work 
with international colleagues, while trying to maintain 
a strong domestic base for training students and young 
scientists. In the past 20 years, more PhD students from 
Leiden have been selected for NASA%S prestigious 
Hubble postdoctoral fellowship programme than from 
any other university outside the United States. 

“Astronomy is a very attractive field for young 
physicists,’ says Quadri’s original contact Marijn Franx, 
a group leader at Leiden University who was awarded a 
Hubble fellowship in 1991. Franx notes that former 
students have found positions in fields as diverse as 
high-energy physics, software development, publishing, 
funding agencies and even the diplomatic service. In 
August, Franx received a€1.5-million ERC advanced 
grant for his research on how distant galaxies come into 
being. He plans to build up a new group comprising a 
tenure-track faculty member, two postdocs and two 
PhD students, to extend his work on high-redshift 
galaxies by looking even further back in time 


Critical evaluation 

The key to success is minimal bureaucracy and an 
efficient, highly competitive science-funding system, 
says Nijkamp. Targeted support mechanisms, for both 
early-stage and senior researchers, include the NWO’s 
Rubicon grants for postdoctoral scientists and the 
prestigious €1.5-million Spinoza prize for outstanding 
achievement. A strict system of performance 
measurement for scientists and university departments 
helps grant reviewers allocate resources efficiently. 

Starting in the 1980s, Dutch universities developed a 
research and teaching evaluation system involving 
external reviewers. Criteria include publications and 
success in attracting grant money. Those who measure 
up have a much better shot at tenure. Nijkamp says the 
system has been a big help in improving the quality of 
Dutch science, although there are winners and losers. 
“Our scientists have learned that they will be strictly 
evaluated, and that their success in getting funded 
depends only on their performance,” says Nijkamp. 

All NWO programmes are heavily oversubscribed, 
with grant success rates in some fields as low as 10%. 
The tough competition, says Nijkamp, does create 
bottlenecks. “We often have to say ‘no’ just because the 
money isnt there,” he says. “That's a pity, because it 
means we're missing out on too many good people.” 

But tight national research budgets have not 
diminished the Netherlands’ attractiveness as a host fr 
young scientists from abroad. Terms of employment 
for PhD students are particularly appealing. They are 
hired on a full-time basis for four years — rather than 
for three years as in most dher European countries, 
where they often have only student status. And they get 
paid unusually well: in their fourth year, they can 
expect a salary of €3,000 a month before tax. 

The extra year gives PhD students more time to 
transform their research into papers, says Jaap 
Sinninghe Damsté, a molecular palaeontologist at the 
Royal Netherlands Institute for Sea Research (NIOZ) 
on the Friesian island of Texel. A good publication 
record early on helps those who want to stay in research 
find attractive postdoctoral positions, he says. 

Not many young scientists have luck founding new 
academic labs, however. Sinninghe Damsté, who has 
won an ERC advanced grant for his method of using 


Re 


Dutch involvement in physics: (clockwise from left) helping to build the TEXTOR plasma 


reactor, equipment at the National Institute for Subatomic Physics, and working ona 
component for the Large Hadron Collider. 


Jung-Hyun Kim (top) and 
Ryan Quadri have both 
enjoyed postdocs in the 
Netherlands. 


fossil molecules in sediments to reconstruct past 
climate, is a rare example of a successful Dutch scientist 
who has never worked outside his native country. With 
the €2.5-million grant he will build up a new group 

in organic biogeochemistry, comprising three PhD 
students, two postdocs and a technician. For the first 
time in many years, the NIOZ — one of ten research 
institutes run by the NWO — will also create two or 
three new tenure-track positions. 

Generally, however, a young scientist’s prospects of 
finding permanent positions there are bleak. “To try to 
become a professional scientist here you really have to 
love research very much,’ says Sinninghe Damsté. 

“It's hard,’ agrees his postdoc Jung-Hyun Kim. A 
marine geologist from South Korea, Kim did a postdoc 
in Bremen, Germany, before moving on to the NIOZ. 
She came to the Netherlands on an EU-funded Marie 
Curie fellowship. Despite the challenges, she says, the 
combination of an open, multicultural society and high 
standards in science make the country an ideal training 
ground for young researchers. 

Like some other countries, such as Germany, most 
science jobs in the Netherlands are occupied by 
postdocs on fixed-term contracts. Kim is therefore 
putting out feelers towards France, where her husband 
lives and where she hopes it will be easier to find a 
permanent position. Her ideal is an ERC starting grant 
that would help her to build up an independent 
research team of her own. 

When his contract in Leiden expires, Quadri, too, 
plans to leave the Netherlands and head back to the 
United States. Despite its success in atracting foreign 
talent, the Netherlands often remains a stopping-off 
place for young scientists early in their careers. 

Even so, Dutch science as a whole benefits from the 
flow of international talent — and those who go there 
benefit from a vibrant and welcoming research 
community. “I got so into astronomy here,’ says 


Quadri, “that all other alternatives disappeared” a 
Quirin Schiermeier is Nature's Germany 
correspondent. 
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Colin Lonsdale, director, Haystack Observatory, 
Massachusetts Institute of Technology, 
Westford, Massachusetts 


2006-08: Assistant director, 
Haystack Observatory, 
Massachusetts Institute of 
Technology 

1986-2008: Research 
scientist, Haystack 
Observatory, Massachusetts 
Institute of Technology 
1983-86: Research 
associate, Pennsylvania State 
University, University Park, 
Pennsylvania 


Colin Lonsdale says radio astronomy is experiencing a 
revolution. The new director of the Haystack Observatory 
in Westford, part of the Massachusetts Institute of 
Technology (MIT), believes that new instruments will 
probe unexplored parts of the distant Universe and allow 
scientists to answer crucial questions about how stars and 
galaxies first formed and continue to grow. 

This is the second such revolution for Lonsdale. When 
working on his PhD in the early 1980s, he demonstrated 
how energy is transported in extragalactic radio sources 
by studying radio jets and hotspots, using the then-new 
MERLIN array run from Jodrell Bank Observatory at the 
University of Manchester, UK. He calls his decision to 
pursue a PhD there “career-defining”. 

As apostdoc at Pennsylvania State University, Lonsdale 
used interferometry techniques to study all aspects of 
extragalactic radio sources. His expertise in data-reduction 
software led him to a research scientist position at the 
Haystack Observatory. “That's when things took off for 
me,” says Lonsdale, who helped to detect an unusual radio 
signal from a complex, ultraluminous infrared galaxy 
created by the explosions of 50 supernovae. 

That discovery, made together with his astronomer 
sister Carol, stunned colleagues. It not only yielded the 
first images of many simultaneously exploding stars inside 
a galaxy, but also altered the prevailing theory of how 
galaxies generate intense radio sources called masers, 
according to long-time colleague Phil Diamond, director of 
the Jodrell Bank Centre for Astrophysics. Diamond says it 
took Lonsdale’s imagination and tenacity to demonstrate 
that these galaxies are starbursts. 

In 2000, Lonsdale turned away from pure research, 
instead devoting his time to developing the Murchison 
Widefield Array (MWA), an array of low-frequency radio 
telescopes in Western Australia. He says the MWA, 
scheduled to begin operation in 2010, will target questions 
suchas how to measure solar storms accurately. 

Lonsdale says advances in digital electronics have 
opened up a new frontier in radio science. Diamond fully 
expects that Lonsdale, who got funding for the MWA in 
difficult times, will use his expertise to help position MIT's 
involvement in the next generation of radio-telescope 
technology — the Square Kilometre Array. Diamond is 
optimistic about the effect Lonsdale can have on this 
project, which he calls “the big daddy of radio astronomy". m 
Virginia Gewin 
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Sustenance for sustainability 


Scientists who seek intensely 
interdisciplinary study could be 
the beneficiaries of increasing 
interest in the emerging field of 
sustainability research, with new 
university programmes offering 
novel opportunities. Portland State 
University in Oregon and Cornell 
University in Ithaca, New York, are the 
most recent entrants to the field. They 
follow the example set by institutions 
such as the School of Sustainability at 
Arizona State University in Tempe. 

Broadly defined, sustainability 
bridges disciplines to determine how 
to meet the resource needs of the 
present without adversely affecting 
future generations. In practice, 
that means assembling teams of 
ecologists, economists, biologists and 
social scientists to find solutions and 
strategies for big problems, such as 
meeting future energy needs. 

Portland has received US$25 million 
from the James F. and Marion L. Miller 
Foundation towards sustainability 
research and teaching, and Cornell's 
$10-million National Science 
Foundation (NSF) grant will be used 
to develop the field of ‘computational 
sustainability’ — using computer- 
science techniques to manage natural 
resources more effectively. 

Portland's provost, Roy Koch, 
says their approach is to connect 


curiosity-driven research to short- 
term goals. Within the next 2 years, 
they plan to recruit at least 10 
faculty members, 10-20 graduate 
students and up to 5 postdocs to 
tackle technical issues surrounding 
alternative energy sources, 
sustainable urban communities 
and developing the metrics of 
sustainability. He also hopes to set 
up a visiting faculty programme 

to forge national and international 
connections. 

Cornell's Institute for Computational 
Sustainability involves scientists from 
both Cornell and other US research 
institutions, whose work will involve 
merging disparate data sets related 
to areas such as environmental 
science and economics. One 
example is modelling the evolution 
of fish populations to assist with the 
designation of no-fishing zones. 

The main focus of the NSF grant is 
to train some 12 graduate students 
each year. Cornell also wants to 
attract postdocs who are interested in 
computer science or applied maths. 
Carla Gomes, the institute's director, 
says that “computer scientists aren't 
aware that their expertise can impact 
issues of sustainability, and we hope 
to inject computational thinking into 
problem solving.” |] 
Virginia Gewin 
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Waiting with baboons 


| have found it hard to return to my research in Ethiopia after my six-week 
hiatus. The fear of the unknown is gone, but so is some of the excitement. And 
this time, | have no one from the outside world with me. The scouts and our 
assistant are around, but there is nobody with whom to muse about science 
over a warmish beer. And, as any semi-sane person can tell you, having too 


much time for your own thoughts can be utterly destructive. 

For the first few days here, | ranted and raved silently, even finding it difficult 
to go outdoors. It was too much ‘reality’ hitting me — I'm here and I'm more or 
less alone. | considered the gelada baboons my only distraction, and negativity 
threatened to devour me. | had to try to see the positive aspects of this 
experience. After all, | want to be here and | love my job. How can | deal with the 
circumstances surrounding it? 

| came to see that | have been given something that | never had in the city: 
time. It is a precious thing, and not all that frightening. Now | am filling my time 
with the things | never get to do back home — reading articles, doing extra 
analyses, working overtime and writing letters. And |'m finding it a positive 
experience. | still love all social distractions, but | don’t constantly bemoan my 
solitude. Only my attitude has changed. Perhaps that’s enough to carry me 
through. r | 
Aliza le Roux is a postdoctoral fellow in animal behaviour at the University of Michigan. 
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me,” says Lonsdale, who helped to detect an unusual radio 
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created by the explosions of 50 supernovae. 
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Scientists who seek intensely 
interdisciplinary study could be 
the beneficiaries of increasing 
interest in the emerging field of 
sustainability research, with new 
university programmes offering 
novel opportunities. Portland State 
University in Oregon and Cornell 
University in Ithaca, New York, are the 
most recent entrants to the field. They 
follow the example set by institutions 
such as the School of Sustainability at 
Arizona State University in Tempe. 

Broadly defined, sustainability 
bridges disciplines to determine how 
to meet the resource needs of the 
present without adversely affecting 
future generations. In practice, 
that means assembling teams of 
ecologists, economists, biologists and 
social scientists to find solutions and 
strategies for big problems, such as 
meeting future energy needs. 

Portland has received US$25 million 
from the James F. and Marion L. Miller 
Foundation towards sustainability 
research and teaching, and Cornell's 
$10-million National Science 
Foundation (NSF) grant will be used 
to develop the field of ‘computational 
sustainability’ — using computer- 
science techniques to manage natural 
resources more effectively. 

Portland's provost, Roy Koch, 
says their approach is to connect 


curiosity-driven research to short- 
term goals. Within the next 2 years, 
they plan to recruit at least 10 
faculty members, 10-20 graduate 
students and up to 5 postdocs to 
tackle technical issues surrounding 
alternative energy sources, 
sustainable urban communities 
and developing the metrics of 
sustainability. He also hopes to set 
up a visiting faculty programme 

to forge national and international 
connections. 

Cornell's Institute for Computational 
Sustainability involves scientists from 
both Cornell and other US research 
institutions, whose work will involve 
merging disparate data sets related 
to areas such as environmental 
science and economics. One 
example is modelling the evolution 
of fish populations to assist with the 
designation of no-fishing zones. 

The main focus of the NSF grant is 
to train some 12 graduate students 
each year. Cornell also wants to 
attract postdocs who are interested in 
computer science or applied maths. 
Carla Gomes, the institute's director, 
says that “computer scientists aren't 
aware that their expertise can impact 
issues of sustainability, and we hope 
to inject computational thinking into 
problem solving.” |] 
Virginia Gewin 
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Waiting with baboons 


| have found it hard to return to my research in Ethiopia after my six-week 
hiatus. The fear of the unknown is gone, but so is some of the excitement. And 
this time, | have no one from the outside world with me. The scouts and our 
assistant are around, but there is nobody with whom to muse about science 
over a warmish beer. And, as any semi-sane person can tell you, having too 


much time for your own thoughts can be utterly destructive. 

For the first few days here, | ranted and raved silently, even finding it difficult 
to go outdoors. It was too much ‘reality’ hitting me — I'm here and I'm more or 
less alone. | considered the gelada baboons my only distraction, and negativity 
threatened to devour me. | had to try to see the positive aspects of this 
experience. After all, | want to be here and | love my job. How can | deal with the 
circumstances surrounding it? 

| came to see that | have been given something that | never had in the city: 
time. It is a precious thing, and not all that frightening. Now | am filling my time 
with the things | never get to do back home — reading articles, doing extra 
analyses, working overtime and writing letters. And |'m finding it a positive 
experience. | still love all social distractions, but | don’t constantly bemoan my 
solitude. Only my attitude has changed. Perhaps that’s enough to carry me 
through. r | 
Aliza le Roux is a postdoctoral fellow in animal behaviour at the University of Michigan. 
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Westford, Massachusetts 
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Colin Lonsdale says radio astronomy is experiencing a 
revolution. The new director of the Haystack Observatory 
in Westford, part of the Massachusetts Institute of 
Technology (MIT), believes that new instruments will 
probe unexplored parts of the distant Universe and allow 
scientists to answer crucial questions about how stars and 
galaxies first formed and continue to grow. 

This is the second such revolution for Lonsdale. When 
working on his PhD in the early 1980s, he demonstrated 
how energy is transported in extragalactic radio sources 
by studying radio jets and hotspots, using the then-new 
MERLIN array run from Jodrell Bank Observatory at the 
University of Manchester, UK. He calls his decision to 
pursue a PhD there “career-defining”. 
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extragalactic radio sources. His expertise in data-reduction 
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signal from a complex, ultraluminous infrared galaxy 
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That discovery, made together with his astronomer 
sister Carol, stunned colleagues. It not only yielded the 
first images of many simultaneously exploding stars inside 
a galaxy, but also altered the prevailing theory of how 
galaxies generate intense radio sources called masers, 
according to long-time colleague Phil Diamond, director of 
the Jodrell Bank Centre for Astrophysics. Diamond says it 
took Lonsdale’s imagination and tenacity to demonstrate 
that these galaxies are starbursts. 

In 2000, Lonsdale turned away from pure research, 
instead devoting his time to developing the Murchison 
Widefield Array (MWA), an array of low-frequency radio 
telescopes in Western Australia. He says the MWA, 
scheduled to begin operation in 2010, will target questions 
suchas how to measure solar storms accurately. 

Lonsdale says advances in digital electronics have 
opened up a new frontier in radio science. Diamond fully 
expects that Lonsdale, who got funding for the MWA in 
difficult times, will use his expertise to help position MIT's 
involvement in the next generation of radio-telescope 
technology — the Square Kilometre Array. Diamond is 
optimistic about the effect Lonsdale can have on this 
project, which he calls “the big daddy of radio astronomy". m 
Virginia Gewin 
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Faux-pas, Doc 


Don't toy with affections. 


Janett L. Grady 


I’m in Paris, on my way to Boston. My 
insides are all screwed up and I’m on my 
way to see if I can get things fixed. If I can’t 
get things fixed, the conservative types in 
World Congress will order me taken apart 
and Ill be fed to the hogs, piece by piece. 
I'm three hours early for the transfer and 
I’ve already been scanned and labelled. 'm 
killing time, wandering back and forth, 
when I suddenly spot her, or think it’s her. 
She’s in one of those ancient hoverchairs. 
I wave, catch her eye and she waves back, 
hovers towards me, a quizzical, surprised 
look on her face. 

“Well, well” she says with a smile “What 
a surprise.” 

I smile in return. “Doc,” I say, “it is you. 
My God, you haven't changed at all.” It’s 
almost true. She’s in the chair, a few pounds 
heavier, a little older-looking but the same 
Doc Tan who turned me on back in 3010. 
“How have you been?” J ask. “It’s been 
— what — 20 years?” 

“At least that long,’ she says. She hovers 
back a little and looks me up and down, 
her blue eyes crinkling at the corners. She's 
gorgeous. Even after all these years, I still 
want to give her a great big kiss. “Damn, 
T-3,” she says, “you're looking fine.” She 
sighs and then frowns prettily. “I never 
thought youd last this long,” she says. “Is 
it all still working?” 

“Still working,” I tell her, and hesitate, 
unsure about telling her what's wrong. “I'm 
at the Capitol,” I tell her. “I’ve been there 
for nine years.” 

“Men, women or what?” she asks. 
“You're wired for both, or at least I tried” 
She laughs. “But with all the women in 
World Congress these days, I wouldn't be 
all that surprised if you've been rewired to 
prefer women.” 

“Youre right;’ I tell her. “I still do men, 
but I've been tweaked to prefer women.’ 

“T figured they might do that, she says. 
“But, hey, no big deal, right?” 

“It is a big deal,’ I shoot back. “When 
I do men, it hurts like hell. There’s some- 
thing wrong. I'm on my way to Mass Tech 
to see if I can get it fixed” 

She doesn’t seem interested. She shrugs, 
looks at her watch. “I’ve got a few min- 
utes,” she says. “You want some wine?” 
She doesr’t wait for an answer. She spins 
around, waves for me to follow, and leads 
the way into the French Space. She leaves 
me at a table, goes for the wine and then 
joins me. She sips from her glass, settles 


back in her chair. “You do look a little 
different,” she says, “but hell, don’t we 
all? Me, I’m stuck in this chair” Her 
eyes seem fixed on my blouse. “So 
what’s wrong?” she asks. “You look 
healthy enough.” 

“Tm not,” I tell her. “The penis- 
pocket is dosing shut, not to mention 
being backed up. I'm not passing the 
way I should be.” 

“It’s been a long 
time,” she says. 

“You’ve been 

screwing your 

brains out, and 

youre scratched and swollen. It’s the 
inside sensors. They wear out.” She 
takes my hand, squeezes hard, then 
lets go. “Tell you what,’ she 
says. ‘I'm in the lab at Notre 
Dame. If they don't fix it in 
Boston, come back and let 
me know. I'll take a look” She 
grins a mischievous grin. “If 
worse comes to worse,’ she 
says, “I'll take it all apart 
and start over. Then you 
can stay with me, service 
yours truly for awhile” 

I’m sipping wine, 
biting my lip, trying to 
keep from looking at 
her. I cart stop thinking 
about her being in the chair and I 
don’t want her to see any trace of fear 
on my face. 

“So what's with the chair?” I ask. “If you 
don't mind me asking, Id like to know what 
happened” 

“Nothing much,’ she says. “A transport 
to Moscow didnt quite take and I lost the 
use of my legs. I's no big deal, though, Ive 
still got feeling down there... Youd still be 
able to drive me wild” 

“Td like that, Doc, but...” 

“No buts,’ she says. “I'll clear it with 
Congress, let em know you're going to be 
with me.’ She glances at her watch, says 
she’s got to run. “I'll be looking for you, 
T-3 She smiles that cute little smile of 
hers, waves and keeps waving as she hov- 
ers away. I assume shes hurrying to catch 
a transfer. “Doc, wait, I shout, but she’s 
already gone. 

My own transport doesn’t happen for 
another hour, so I just sit there drinking 
wine. I'm thinking about how Doc is in a 
hoverchair, and figure if she can’t fix her- 
self, how in the hell is she going to fix me. 
True, shes the one who turned me on, but 


© 2008 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 455/30 October 2008 


that was 20 years ago. It’s not that I don't 
want to be at her beck and call, because in 
a way I always have been. But Doc Tan has 
always been well connected with the anti- 
science, anti-sex types in Congress. Besides, 
if Doc is at Notre Dame, a man’s world, it’s 
my guess shell simply take me apart, have 
me fed to the hogs, piece by piece, and start 
over. Im not going to let her do t, no matter 
what. While I'm no longer a perfect toy for 
men, I'm still pretty good when it comes to 
women. I just need fixing, that’s all. 

Sorry, Doc, but I just don’t trust you. 
For one thing, the inside of what you 
first designed and put together is now all 
screwed up. For another, I'm Tan Number 
2, not Tan Number 3. T-2, not T-3. | 
Janett L. Grady is a senior citizen who 
writes from Palmer, Alaska. Her work has 
appeared ina variety of magazines and 
newspapers. This story is her first 
attempt at writing science fiction. 
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